Dataset statistics
| Number of variables | 45 |
|---|---|
| Number of observations | 407684 |
| Missing cells | 3871070 |
| Missing cells (%) | 21.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 159.2 MiB |
| Average record size in memory | 409.5 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 34 |
| Boolean | 1 |
ori has constant value "CA0371100" | Constant |
agency has constant value "SD" | Constant |
gendnc_code has constant value "5.0" | Constant |
id has a high cardinality: 407684 distinct values | High cardinality |
date has a high cardinality: 912 distinct values | High cardinality |
time has a high cardinality: 77771 distinct values | High cardinality |
inters has a high cardinality: 15939 distinct values | High cardinality |
street has a high cardinality: 44668 distinct values | High cardinality |
hw_exit has a high cardinality: 2211 distinct values | High cardinality |
school_name has a high cardinality: 85 distinct values | High cardinality |
beat_name has a high cardinality: 127 distinct values | High cardinality |
disability has a high cardinality: 134 distinct values | High cardinality |
reason_text has a high cardinality: 1697 distinct values | High cardinality |
reason_detail has a high cardinality: 282 distinct values | High cardinality |
reason_exp has a high cardinality: 183583 distinct values | High cardinality |
search_basis has a high cardinality: 721 distinct values | High cardinality |
search_basis_exp has a high cardinality: 28990 distinct values | High cardinality |
prop_type has a high cardinality: 490 distinct values | High cardinality |
cont has a high cardinality: 669 distinct values | High cardinality |
actions has a high cardinality: 11672 distinct values | High cardinality |
act_consent has a high cardinality: 335 distinct values | High cardinality |
is_serv is highly imbalanced (50.6%) | Imbalance |
assign_words is highly imbalanced (84.0%) | Imbalance |
is_school is highly imbalanced (99.1%) | Imbalance |
city is highly imbalanced (96.7%) | Imbalance |
is_student is highly imbalanced (99.5%) | Imbalance |
lim_eng is highly imbalanced (86.4%) | Imbalance |
gender_words is highly imbalanced (56.8%) | Imbalance |
is_gendnc is highly imbalanced (99.5%) | Imbalance |
gender_code is highly imbalanced (62.7%) | Imbalance |
lgbt is highly imbalanced (82.4%) | Imbalance |
disability is highly imbalanced (95.0%) | Imbalance |
reason_words is highly imbalanced (56.3%) | Imbalance |
reason_detail is highly imbalanced (65.7%) | Imbalance |
search_basis is highly imbalanced (69.5%) | Imbalance |
cont is highly imbalanced (91.4%) | Imbalance |
actions is highly imbalanced (70.9%) | Imbalance |
act_consent is highly imbalanced (64.6%) | Imbalance |
inters has 366868 (90.0%) missing values | Missing |
block has 43330 (10.6%) missing values | Missing |
ldmk has 407643 (> 99.9%) missing values | Missing |
street has 16834 (4.1%) missing values | Missing |
hw_exit has 404618 (99.2%) missing values | Missing |
school_name has 407362 (99.9%) missing values | Missing |
gendnc_code has 407507 (> 99.9%) missing values | Missing |
reasonid has 18844 (4.6%) missing values | Missing |
reason_text has 18844 (4.6%) missing values | Missing |
reason_detail has 18838 (4.6%) missing values | Missing |
search_basis has 321160 (78.8%) missing values | Missing |
search_basis_exp has 344258 (84.4%) missing values | Missing |
seiz_basis has 398568 (97.8%) missing values | Missing |
prop_type has 398568 (97.8%) missing values | Missing |
act_consent has 297641 (73.0%) missing values | Missing |
block is highly skewed (γ1 = 254.9436976) | Skewed |
id is uniformly distributed | Uniform |
ldmk is uniformly distributed | Uniform |
id has unique values | Unique |
Reproduction
| Analysis started | 2023-04-28 21:31:57.474986 |
|---|---|
| Analysis finished | 2023-04-28 21:32:28.554789 |
| Duration | 31.08 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
Unnamed: 0
Real number (ℝ)
| Distinct | 187251 |
|---|---|
| Distinct (%) | 45.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 76803.876 |
| Minimum | 1 |
|---|---|
| Maximum | 187251 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6795.15 |
| Q1 | 33974 |
| median | 67948 |
| Q3 | 117975 |
| 95-th percentile | 166866.85 |
| Maximum | 187251 |
| Range | 187250 |
| Interquartile range (IQR) | 84001 |
Descriptive statistics
| Standard deviation | 50412.731 |
|---|---|
| Coefficient of variation (CV) | 0.65638264 |
| Kurtosis | -0.97628217 |
| Mean | 76803.876 |
| Median Absolute Deviation (MAD) | 40395 |
| Skewness | 0.35472575 |
| Sum | 3.1311711 × 1010 |
| Variance | 2.5414434 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3 | < 0.1% |
| 46542 | 3 | < 0.1% |
| 46548 | 3 | < 0.1% |
| 46547 | 3 | < 0.1% |
| 46546 | 3 | < 0.1% |
| 46545 | 3 | < 0.1% |
| 46544 | 3 | < 0.1% |
| 46543 | 3 | < 0.1% |
| 46541 | 3 | < 0.1% |
| 46635 | 3 | < 0.1% |
| Other values (187241) | 407654 |
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 2 | 3 | |
| 3 | 3 | |
| 4 | 3 | |
| 5 | 3 | |
| 6 | 3 | |
| 7 | 3 | |
| 8 | 3 | |
| 9 | 3 | |
| 10 | 3 |
| Value | Count | Frequency (%) |
| 187251 | 1 | |
| 187250 | 1 | |
| 187249 | 1 | |
| 187248 | 1 | |
| 187247 | 1 | |
| 187246 | 1 | |
| 187245 | 1 | |
| 187244 | 1 | |
| 187243 | 1 | |
| 187242 | 1 |
stop_id
Real number (ℝ)
| Distinct | 353547 |
|---|---|
| Distinct (%) | 86.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 269011.35 |
| Minimum | 84362 |
|---|---|
| Maximum | 449933 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 MiB |
Quantile statistics
| Minimum | 84362 |
|---|---|
| 5-th percentile | 106360.15 |
| Q1 | 177796.75 |
| median | 269751.5 |
| Q3 | 359576.25 |
| 95-th percentile | 431524.85 |
| Maximum | 449933 |
| Range | 365571 |
| Interquartile range (IQR) | 181779.5 |
Descriptive statistics
| Standard deviation | 104491.5 |
|---|---|
| Coefficient of variation (CV) | 0.38842786 |
| Kurtosis | -1.1966487 |
| Mean | 269011.35 |
| Median Absolute Deviation (MAD) | 90905 |
| Skewness | -0.006532643 |
| Sum | 1.0967162 × 1011 |
| Variance | 1.0918474 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 174011 | 52 | < 0.1% |
| 184085 | 48 | < 0.1% |
| 180326 | 46 | < 0.1% |
| 169932 | 42 | < 0.1% |
| 183655 | 40 | < 0.1% |
| 161095 | 39 | < 0.1% |
| 174472 | 38 | < 0.1% |
| 236965 | 35 | < 0.1% |
| 170316 | 34 | < 0.1% |
| 169927 | 32 | < 0.1% |
| Other values (353537) | 407278 |
| Value | Count | Frequency (%) |
| 84362 | 1 | |
| 84364 | 1 | |
| 84365 | 1 | |
| 84366 | 1 | |
| 84369 | 1 | |
| 84370 | 1 | |
| 84371 | 1 | |
| 84372 | 2 | |
| 84373 | 1 | |
| 84374 | 1 |
| Value | Count | Frequency (%) |
| 449933 | 1 | < 0.1% |
| 449726 | 1 | < 0.1% |
| 449716 | 1 | < 0.1% |
| 449709 | 1 | < 0.1% |
| 449701 | 1 | < 0.1% |
| 449694 | 1 | < 0.1% |
| 449693 | 2 | |
| 449692 | 1 | < 0.1% |
| 449687 | 3 | |
| 449675 | 1 | < 0.1% |
pid
Real number (ℝ)
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.2621442 |
| Minimum | 1 |
|---|---|
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.2245322 |
|---|---|
| Coefficient of variation (CV) | 0.9701999 |
| Kurtosis | 338.29907 |
| Mean | 1.2621442 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 14.708297 |
| Sum | 514556 |
| Variance | 1.4994791 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 353540 | |
| 2 | 35536 | 8.7% |
| 3 | 9722 | 2.4% |
| 4 | 3793 | 0.9% |
| 5 | 1715 | 0.4% |
| 6 | 889 | 0.2% |
| 7 | 531 | 0.1% |
| 8 | 360 | 0.1% |
| 9 | 257 | 0.1% |
| 10 | 210 | 0.1% |
| Other values (42) | 1131 | 0.3% |
| Value | Count | Frequency (%) |
| 1 | 353540 | |
| 2 | 35536 | 8.7% |
| 3 | 9722 | 2.4% |
| 4 | 3793 | 0.9% |
| 5 | 1715 | 0.4% |
| 6 | 889 | 0.2% |
| 7 | 531 | 0.1% |
| 8 | 360 | 0.1% |
| 9 | 257 | 0.1% |
| 10 | 210 | 0.1% |
| Value | Count | Frequency (%) |
| 52 | 1 | < 0.1% |
| 51 | 1 | < 0.1% |
| 50 | 1 | < 0.1% |
| 49 | 1 | < 0.1% |
| 48 | 2 | |
| 47 | 2 | |
| 46 | 3 | |
| 45 | 3 | |
| 44 | 3 | |
| 43 | 3 |
id
Categorical
HIGH CARDINALITY  UNIFORM  UNIQUE 
| Distinct | 407684 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| 84362_1 | 1 |
|---|---|
| 329041_1 | 1 |
| 329039_1 | 1 |
| 329038_1 | 1 |
| 329037_1 | 1 |
| Other values (407679) |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.9652083 |
| Min length | 7 |
Characters and Unicode
| Total characters | 3247288 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 407684 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 84362_1 |
|---|---|
| 2nd row | 84364_1 |
| 3rd row | 84365_1 |
| 4th row | 84366_1 |
| 5th row | 84369_1 |
Common Values
| Value | Count | Frequency (%) |
| 84362_1 | 1 | < 0.1% |
| 329041_1 | 1 | < 0.1% |
| 329039_1 | 1 | < 0.1% |
| 329038_1 | 1 | < 0.1% |
| 329037_1 | 1 | < 0.1% |
| 329036_3 | 1 | < 0.1% |
| 329036_2 | 1 | < 0.1% |
| 329036_1 | 1 | < 0.1% |
| 329035_1 | 1 | < 0.1% |
| 329034_1 | 1 | < 0.1% |
| Other values (407674) | 407674 |
Length
| Value | Count | Frequency (%) |
| 84362_1 | 1 | < 0.1% |
| 84365_1 | 1 | < 0.1% |
| 84369_1 | 1 | < 0.1% |
| 84370_1 | 1 | < 0.1% |
| 84371_1 | 1 | < 0.1% |
| 84372_1 | 1 | < 0.1% |
| 84372_2 | 1 | < 0.1% |
| 84373_1 | 1 | < 0.1% |
| 84374_1 | 1 | < 0.1% |
| 84375_1 | 1 | < 0.1% |
| Other values (407674) | 407674 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 674216 | |
| _ | 407684 | |
| 2 | 353904 | |
| 3 | 332496 | |
| 4 | 269023 | 8.3% |
| 9 | 209563 | 6.5% |
| 0 | 205642 | 6.3% |
| 8 | 203747 | 6.3% |
| 6 | 198566 | 6.1% |
| 5 | 196463 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2839604 | |
| Connector Punctuation | 407684 | 12.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 674216 | |
| 2 | 353904 | |
| 3 | 332496 | |
| 4 | 269023 | 9.5% |
| 9 | 209563 | 7.4% |
| 0 | 205642 | 7.2% |
| 8 | 203747 | 7.2% |
| 6 | 198566 | 7.0% |
| 5 | 196463 | 6.9% |
| 7 | 195984 | 6.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 407684 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3247288 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 674216 | |
| _ | 407684 | |
| 2 | 353904 | |
| 3 | 332496 | |
| 4 | 269023 | 8.3% |
| 9 | 209563 | 6.5% |
| 0 | 205642 | 6.3% |
| 8 | 203747 | 6.3% |
| 6 | 198566 | 6.1% |
| 5 | 196463 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3247288 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 674216 | |
| _ | 407684 | |
| 2 | 353904 | |
| 3 | 332496 | |
| 4 | 269023 | 8.3% |
| 9 | 209563 | 6.5% |
| 0 | 205642 | 6.3% |
| 8 | 203747 | 6.3% |
| 6 | 198566 | 6.1% |
| 5 | 196463 | 6.1% |
ori
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| CA0371100 |
|---|
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 3669156 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CA0371100 |
|---|---|
| 2nd row | CA0371100 |
| 3rd row | CA0371100 |
| 4th row | CA0371100 |
| 5th row | CA0371100 |
Common Values
| Value | Count | Frequency (%) |
| CA0371100 | 407684 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ca0371100 | 407684 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1223052 | |
| 1 | 815368 | |
| C | 407684 | 11.1% |
| A | 407684 | 11.1% |
| 3 | 407684 | 11.1% |
| 7 | 407684 | 11.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2853788 | |
| Uppercase Letter | 815368 | 22.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1223052 | |
| 1 | 815368 | |
| 3 | 407684 | 14.3% |
| 7 | 407684 | 14.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 407684 | |
| A | 407684 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2853788 | |
| Latin | 815368 | 22.2% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1223052 | |
| 1 | 815368 | |
| 3 | 407684 | 14.3% |
| 7 | 407684 | 14.3% |
Latin
| Value | Count | Frequency (%) |
| C | 407684 | |
| A | 407684 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3669156 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1223052 | |
| 1 | 815368 | |
| C | 407684 | 11.1% |
| A | 407684 | 11.1% |
| 3 | 407684 | 11.1% |
| 7 | 407684 | 11.1% |
agency
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| SD |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 815368 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SD |
|---|---|
| 2nd row | SD |
| 3rd row | SD |
| 4th row | SD |
| 5th row | SD |
Common Values
| Value | Count | Frequency (%) |
| SD | 407684 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| sd | 407684 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 407684 | |
| D | 407684 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 815368 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 407684 | |
| D | 407684 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 815368 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 407684 | |
| D | 407684 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 815368 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 407684 | |
| D | 407684 |
exp_years
Real number (ℝ)
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.2757896 |
| Minimum | 1 |
|---|---|
| Maximum | 50 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 10 |
| 95-th percentile | 21 |
| Maximum | 50 |
| Range | 49 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 7.0895988 |
|---|---|
| Coefficient of variation (CV) | 1.1296744 |
| Kurtosis | 2.1497799 |
| Mean | 6.2757896 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.5884159 |
| Sum | 2558539 |
| Variance | 50.262411 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 152249 | |
| 3 | 33347 | 8.2% |
| 2 | 30587 | 7.5% |
| 5 | 30179 | 7.4% |
| 4 | 24049 | 5.9% |
| 10 | 16650 | 4.1% |
| 11 | 12187 | 3.0% |
| 18 | 11768 | 2.9% |
| 9 | 10901 | 2.7% |
| 12 | 9835 | 2.4% |
| Other values (30) | 75932 |
| Value | Count | Frequency (%) |
| 1 | 152249 | |
| 2 | 30587 | 7.5% |
| 3 | 33347 | 8.2% |
| 4 | 24049 | 5.9% |
| 5 | 30179 | 7.4% |
| 6 | 9370 | 2.3% |
| 7 | 4610 | 1.1% |
| 8 | 5255 | 1.3% |
| 9 | 10901 | 2.7% |
| 10 | 16650 | 4.1% |
| Value | Count | Frequency (%) |
| 50 | 4 | < 0.1% |
| 49 | 23 | < 0.1% |
| 48 | 231 | |
| 45 | 33 | < 0.1% |
| 37 | 2 | < 0.1% |
| 35 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 33 | 35 | < 0.1% |
| 32 | 197 | |
| 31 | 88 | < 0.1% |
date
Categorical
| Distinct | 912 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| 2020-02-12 | 799 |
|---|---|
| 2019-05-23 | 793 |
| 2020-02-11 | 791 |
| 2019-07-06 | 755 |
| 2020-01-16 | 749 |
| Other values (907) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 4076840 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2019-01-01 |
|---|---|
| 2nd row | 2019-01-01 |
| 3rd row | 2019-01-01 |
| 4th row | 2019-01-01 |
| 5th row | 2019-01-01 |
Common Values
| Value | Count | Frequency (%) |
| 2020-02-12 | 799 | 0.2% |
| 2019-05-23 | 793 | 0.2% |
| 2020-02-11 | 791 | 0.2% |
| 2019-07-06 | 755 | 0.2% |
| 2020-01-16 | 749 | 0.2% |
| 2019-10-23 | 734 | 0.2% |
| 2019-09-24 | 733 | 0.2% |
| 2019-08-21 | 722 | 0.2% |
| 2019-10-02 | 715 | 0.2% |
| 2019-03-27 | 712 | 0.2% |
| Other values (902) | 400181 |
Length
| Value | Count | Frequency (%) |
| 2020-02-12 | 799 | 0.2% |
| 2019-05-23 | 793 | 0.2% |
| 2020-02-11 | 791 | 0.2% |
| 2019-07-06 | 755 | 0.2% |
| 2020-01-16 | 749 | 0.2% |
| 2019-10-23 | 734 | 0.2% |
| 2019-09-24 | 733 | 0.2% |
| 2019-08-21 | 722 | 0.2% |
| 2019-10-02 | 715 | 0.2% |
| 2019-03-27 | 712 | 0.2% |
| Other values (902) | 400181 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1073720 | |
| 2 | 865923 | |
| - | 815368 | |
| 1 | 591994 | |
| 9 | 254877 | 6.3% |
| 3 | 102764 | 2.5% |
| 5 | 80571 | 2.0% |
| 4 | 79204 | 1.9% |
| 6 | 74134 | 1.8% |
| 7 | 69263 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3261472 | |
| Dash Punctuation | 815368 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1073720 | |
| 2 | 865923 | |
| 1 | 591994 | |
| 9 | 254877 | 7.8% |
| 3 | 102764 | 3.2% |
| 5 | 80571 | 2.5% |
| 4 | 79204 | 2.4% |
| 6 | 74134 | 2.3% |
| 7 | 69263 | 2.1% |
| 8 | 69022 | 2.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 815368 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4076840 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1073720 | |
| 2 | 865923 | |
| - | 815368 | |
| 1 | 591994 | |
| 9 | 254877 | 6.3% |
| 3 | 102764 | 2.5% |
| 5 | 80571 | 2.0% |
| 4 | 79204 | 1.9% |
| 6 | 74134 | 1.8% |
| 7 | 69263 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4076840 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1073720 | |
| 2 | 865923 | |
| - | 815368 | |
| 1 | 591994 | |
| 9 | 254877 | 6.3% |
| 3 | 102764 | 2.5% |
| 5 | 80571 | 2.0% |
| 4 | 79204 | 1.9% |
| 6 | 74134 | 1.8% |
| 7 | 69263 | 1.7% |
time
Categorical
| Distinct | 77771 |
|---|---|
| Distinct (%) | 19.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| 16:00:00 | 1122 |
|---|---|
| 10:00:00 | 982 |
| 08:00:00 | 976 |
| 15:00:00 | 976 |
| 11:00:00 | 941 |
| Other values (77766) |
Length
| Max length | 19 |
|---|---|
| Median length | 8 |
| Mean length | 8.0024823 |
| Min length | 8 |
Characters and Unicode
| Total characters | 3262484 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 14534 ? |
|---|---|
| Unique (%) | 3.6% |
Sample
| 1st row | 00:15:07 |
|---|---|
| 2nd row | 00:15:16 |
| 3rd row | 00:02:00 |
| 4th row | 00:38:00 |
| 5th row | 01:06:41 |
Common Values
| Value | Count | Frequency (%) |
| 16:00:00 | 1122 | 0.3% |
| 10:00:00 | 982 | 0.2% |
| 08:00:00 | 976 | 0.2% |
| 15:00:00 | 976 | 0.2% |
| 11:00:00 | 941 | 0.2% |
| 09:00:00 | 936 | 0.2% |
| 22:00:00 | 914 | 0.2% |
| 17:00:00 | 900 | 0.2% |
| 15:30:00 | 817 | 0.2% |
| 07:00:00 | 800 | 0.2% |
| Other values (77761) | 398320 |
Length
| Value | Count | Frequency (%) |
| 16:00:00 | 1122 | 0.3% |
| 10:00:00 | 982 | 0.2% |
| 08:00:00 | 976 | 0.2% |
| 15:00:00 | 976 | 0.2% |
| 11:00:00 | 941 | 0.2% |
| 09:00:00 | 936 | 0.2% |
| 22:00:00 | 914 | 0.2% |
| 17:00:00 | 900 | 0.2% |
| 15:30:00 | 817 | 0.2% |
| 07:00:00 | 800 | 0.2% |
| Other values (77754) | 398412 |
Most occurring characters
| Value | Count | Frequency (%) |
| : | 815368 | |
| 0 | 690766 | |
| 1 | 418785 | |
| 2 | 291617 | 8.9% |
| 5 | 228173 | 7.0% |
| 3 | 221424 | 6.8% |
| 4 | 196101 | 6.0% |
| 8 | 104305 | 3.2% |
| 7 | 100548 | 3.1% |
| 9 | 100195 | 3.1% |
| Other values (3) | 95202 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2446840 | |
| Other Punctuation | 815368 | 25.0% |
| Dash Punctuation | 184 | < 0.1% |
| Space Separator | 92 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 690766 | |
| 1 | 418785 | |
| 2 | 291617 | |
| 5 | 228173 | 9.3% |
| 3 | 221424 | 9.0% |
| 4 | 196101 | 8.0% |
| 8 | 104305 | 4.3% |
| 7 | 100548 | 4.1% |
| 9 | 100195 | 4.1% |
| 6 | 94926 | 3.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 815368 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 184 |
Space Separator
| Value | Count | Frequency (%) |
| 92 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3262484 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| : | 815368 | |
| 0 | 690766 | |
| 1 | 418785 | |
| 2 | 291617 | 8.9% |
| 5 | 228173 | 7.0% |
| 3 | 221424 | 6.8% |
| 4 | 196101 | 6.0% |
| 8 | 104305 | 3.2% |
| 7 | 100548 | 3.1% |
| 9 | 100195 | 3.1% |
| Other values (3) | 95202 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3262484 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| : | 815368 | |
| 0 | 690766 | |
| 1 | 418785 | |
| 2 | 291617 | 8.9% |
| 5 | 228173 | 7.0% |
| 3 | 221424 | 6.8% |
| 4 | 196101 | 6.0% |
| 8 | 104305 | 3.2% |
| 7 | 100548 | 3.1% |
| 9 | 100195 | 3.1% |
| Other values (3) | 95202 | 2.9% |
dur
Real number (ℝ)
| Distinct | 337 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.579856 |
| Minimum | 1 |
|---|---|
| Maximum | 1440 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 10 |
| median | 15 |
| Q3 | 30 |
| 95-th percentile | 120 |
| Maximum | 1440 |
| Range | 1439 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 49.791228 |
|---|---|
| Coefficient of variation (CV) | 1.7421791 |
| Kurtosis | 182.30022 |
| Mean | 28.579856 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 9.2495682 |
| Sum | 11651550 |
| Variance | 2479.1664 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 99603 | |
| 15 | 49427 | |
| 5 | 42258 | |
| 20 | 41432 | |
| 30 | 28548 | 7.0% |
| 60 | 17833 | 4.4% |
| 8 | 13319 | 3.3% |
| 120 | 12718 | 3.1% |
| 6 | 12384 | 3.0% |
| 7 | 9335 | 2.3% |
| Other values (327) | 80827 |
| Value | Count | Frequency (%) |
| 1 | 1035 | 0.3% |
| 2 | 2654 | 0.7% |
| 3 | 2755 | 0.7% |
| 4 | 2327 | 0.6% |
| 5 | 42258 | |
| 6 | 12384 | 3.0% |
| 7 | 9335 | 2.3% |
| 8 | 13319 | 3.3% |
| 9 | 4340 | 1.1% |
| 10 | 99603 |
| Value | Count | Frequency (%) |
| 1440 | 52 | |
| 1422 | 1 | < 0.1% |
| 1400 | 24 | |
| 1355 | 1 | < 0.1% |
| 1330 | 2 | < 0.1% |
| 1301 | 1 | < 0.1% |
| 1300 | 3 | < 0.1% |
| 1230 | 1 | < 0.1% |
| 1220 | 1 | < 0.1% |
| 1210 | 4 | < 0.1% |
is_serv
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 407684 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 363639 | |
| 1 | 44045 | 10.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 363639 | |
| 1 | 44045 | 10.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 363639 | |
| 1 | 44045 | 10.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 407684 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 363639 | |
| 1 | 44045 | 10.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 407684 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 363639 | |
| 1 | 44045 | 10.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 407684 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 363639 | |
| 1 | 44045 | 10.8% |
assign_key
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4390827 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 5 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.8204938 |
|---|---|
| Coefficient of variation (CV) | 1.2650377 |
| Kurtosis | 16.054601 |
| Mean | 1.4390827 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.1949149 |
| Sum | 586691 |
| Variance | 3.3141978 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 378571 | |
| 10 | 13106 | 3.2% |
| 2 | 7104 | 1.7% |
| 9 | 3700 | 0.9% |
| 5 | 1624 | 0.4% |
| 7 | 1247 | 0.3% |
| 6 | 802 | 0.2% |
| 4 | 626 | 0.2% |
| 8 | 535 | 0.1% |
| 3 | 369 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 378571 | |
| 2 | 7104 | 1.7% |
| 3 | 369 | 0.1% |
| 4 | 626 | 0.2% |
| 5 | 1624 | 0.4% |
| 6 | 802 | 0.2% |
| 7 | 1247 | 0.3% |
| 8 | 535 | 0.1% |
| 9 | 3700 | 0.9% |
| 10 | 13106 | 3.2% |
| Value | Count | Frequency (%) |
| 10 | 13106 | 3.2% |
| 9 | 3700 | 0.9% |
| 8 | 535 | 0.1% |
| 7 | 1247 | 0.3% |
| 6 | 802 | 0.2% |
| 5 | 1624 | 0.4% |
| 4 | 626 | 0.2% |
| 3 | 369 | 0.1% |
| 2 | 7104 | 1.7% |
| 1 | 378571 |
assign_words
Categorical
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| Patrol, traffic enforcement, field operations | |
|---|---|
| Other | 13106 |
| Gang enforcement | 7104 |
| Investigative/detective | 3700 |
| Roadblock or DUI sobriety checkpoint | 1624 |
| Other values (5) | 3579 |
Length
| Max length | 78 |
|---|---|
| Median length | 45 |
| Mean length | 42.774671 |
| Min length | 5 |
Characters and Unicode
| Total characters | 17438549 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Patrol, traffic enforcement, field operations |
|---|---|
| 2nd row | Patrol, traffic enforcement, field operations |
| 3rd row | Patrol, traffic enforcement, field operations |
| 4th row | Patrol, traffic enforcement, field operations |
| 5th row | Patrol, traffic enforcement, field operations |
Common Values
| Value | Count | Frequency (%) |
| Patrol, traffic enforcement, field operations | 378571 | |
| Other | 13106 | 3.2% |
| Gang enforcement | 7104 | 1.7% |
| Investigative/detective | 3700 | 0.9% |
| Roadblock or DUI sobriety checkpoint | 1624 | 0.4% |
| Task force | 1247 | 0.3% |
| Narcotics/vice | 802 | 0.2% |
| Special events | 626 | 0.2% |
| K1-12 public school inlcuding school resource officer or school police officer | 535 | 0.1% |
| Compliance check | 369 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| enforcement | 385675 | |
| patrol | 378571 | |
| field | 378571 | |
| operations | 378571 | |
| traffic | 378571 | |
| other | 13106 | 0.7% |
| gang | 7104 | 0.4% |
| investigative/detective | 3700 | 0.2% |
| or | 2159 | 0.1% |
| roadblock | 1624 | 0.1% |
| Other values (17) | 15508 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1956361 | |
| t | 1553970 | |
| r | 1542466 | |
| o | 1537811 | |
| 1535476 | ||
| f | 1524775 | |
| n | 1164414 | 6.7% |
| i | 1155870 | 6.6% |
| a | 1151185 | 6.6% |
| c | 783019 | 4.5% |
| Other values (29) | 3533202 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14726733 | |
| Space Separator | 1535476 | 8.8% |
| Other Punctuation | 761644 | 4.4% |
| Uppercase Letter | 412556 | 2.4% |
| Decimal Number | 1605 | < 0.1% |
| Dash Punctuation | 535 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1956361 | |
| t | 1553970 | |
| r | 1542466 | |
| o | 1537811 | |
| f | 1524775 | |
| n | 1164414 | |
| i | 1155870 | |
| a | 1151185 | |
| c | 783019 | |
| l | 762971 | 5.2% |
| Other values (11) | 1593891 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 378571 | |
| O | 13106 | 3.2% |
| G | 7104 | 1.7% |
| I | 5324 | 1.3% |
| R | 1624 | 0.4% |
| D | 1624 | 0.4% |
| U | 1624 | 0.4% |
| T | 1247 | 0.3% |
| N | 802 | 0.2% |
| S | 626 | 0.2% |
| Other values (2) | 904 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 757142 | |
| / | 4502 | 0.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1070 | |
| 2 | 535 |
Space Separator
| Value | Count | Frequency (%) |
| 1535476 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 535 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15139289 | |
| Common | 2299260 | 13.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1956361 | |
| t | 1553970 | |
| r | 1542466 | |
| o | 1537811 | |
| f | 1524775 | |
| n | 1164414 | |
| i | 1155870 | |
| a | 1151185 | |
| c | 783019 | 5.2% |
| l | 762971 | 5.0% |
| Other values (23) | 2006447 |
Common
| Value | Count | Frequency (%) |
| 1535476 | ||
| , | 757142 | |
| / | 4502 | 0.2% |
| 1 | 1070 | < 0.1% |
| - | 535 | < 0.1% |
| 2 | 535 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17438549 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1956361 | |
| t | 1553970 | |
| r | 1542466 | |
| o | 1537811 | |
| 1535476 | ||
| f | 1524775 | |
| n | 1164414 | 6.7% |
| i | 1155870 | 6.6% |
| a | 1151185 | 6.6% |
| c | 783019 | 4.5% |
| Other values (29) | 3533202 |
inters
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 15939 |
|---|---|
| Distinct (%) | 39.1% |
| Missing | 366868 |
| Missing (%) | 90.0% |
| Memory size | 6.2 MiB |
| BROADWAY | 278 |
|---|---|
| MIRAMAR WAY | 250 |
| CAMINO DE LA PLAZA/ CAMIONES WAY | 222 |
| OTAY VALLEY ROAD/ AVENIDA DE LAS VISTAS | 145 |
| G Street | 137 |
| Other values (15934) |
Length
| Max length | 77 |
|---|---|
| Median length | 59 |
| Mean length | 13.920178 |
| Min length | 1 |
Characters and Unicode
| Total characters | 568166 |
|---|---|
| Distinct characters | 78 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 11202 ? |
|---|---|
| Unique (%) | 27.4% |
Sample
| 1st row | governor dr |
|---|---|
| 2nd row | la jolla village dr |
| 3rd row | mission/hornblend |
| 4th row | hornblend/mission blvd |
| 5th row | clairemont mesa blvd |
Common Values
| Value | Count | Frequency (%) |
| BROADWAY | 278 | 0.1% |
| MIRAMAR WAY | 250 | 0.1% |
| CAMINO DE LA PLAZA/ CAMIONES WAY | 222 | 0.1% |
| OTAY VALLEY ROAD/ AVENIDA DE LAS VISTAS | 145 | < 0.1% |
| G Street | 137 | < 0.1% |
| imperial | 129 | < 0.1% |
| garnet | 128 | < 0.1% |
| w ash | 127 | < 0.1% |
| MARKET ST | 108 | < 0.1% |
| I-15 | 105 | < 0.1% |
| Other values (15929) | 39187 | 9.6% |
| (Missing) | 366868 |
Length
| Value | Count | Frequency (%) |
| and | 3985 | 3.7% |
| st | 3619 | 3.4% |
| ave | 3372 | 3.2% |
| 3273 | 3.1% | |
| street | 2699 | 2.5% |
| beach | 1622 | 1.5% |
| rd | 1616 | 1.5% |
| mission | 1598 | 1.5% |
| blvd | 1541 | 1.4% |
| dr | 1353 | 1.3% |
| Other values (4709) | 81672 |
Most occurring characters
| Value | Count | Frequency (%) |
| 65653 | 11.6% | |
| a | 30420 | 5.4% |
| e | 28576 | 5.0% |
| A | 27394 | 4.8% |
| r | 22026 | 3.9% |
| E | 20689 | 3.6% |
| n | 19236 | 3.4% |
| t | 18707 | 3.3% |
| R | 17141 | 3.0% |
| o | 16782 | 3.0% |
| Other values (68) | 301542 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 253618 | |
| Uppercase Letter | 218264 | |
| Space Separator | 65653 | 11.6% |
| Decimal Number | 19086 | 3.4% |
| Other Punctuation | 9867 | 1.7% |
| Dash Punctuation | 1667 | 0.3% |
| Open Punctuation | 5 | < 0.1% |
| Close Punctuation | 5 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 30420 | |
| e | 28576 | |
| r | 22026 | 8.7% |
| n | 19236 | 7.6% |
| t | 18707 | 7.4% |
| o | 16782 | 6.6% |
| i | 16117 | 6.4% |
| s | 14379 | 5.7% |
| l | 13637 | 5.4% |
| d | 13130 | 5.2% |
| Other values (16) | 60608 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 27394 | |
| E | 20689 | 9.5% |
| R | 17141 | 7.9% |
| S | 15712 | 7.2% |
| I | 15083 | 6.9% |
| N | 13468 | 6.2% |
| O | 12558 | 5.8% |
| T | 11194 | 5.1% |
| L | 10226 | 4.7% |
| C | 9970 | 4.6% |
| Other values (16) | 64829 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 8725 | |
| . | 577 | 5.8% |
| & | 194 | 2.0% |
| @ | 181 | 1.8% |
| , | 116 | 1.2% |
| ' | 59 | 0.6% |
| ! | 5 | 0.1% |
| : | 5 | 0.1% |
| # | 3 | < 0.1% |
| % | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 5996 | |
| 1 | 3466 | |
| 8 | 2088 | 10.9% |
| 0 | 1848 | 9.7% |
| 6 | 1280 | 6.7% |
| 4 | 1200 | 6.3% |
| 3 | 1120 | 5.9% |
| 2 | 971 | 5.1% |
| 9 | 627 | 3.3% |
| 7 | 490 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 65653 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1667 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 471882 | |
| Common | 96284 | 16.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 30420 | 6.4% |
| e | 28576 | 6.1% |
| A | 27394 | 5.8% |
| r | 22026 | 4.7% |
| E | 20689 | 4.4% |
| n | 19236 | 4.1% |
| t | 18707 | 4.0% |
| R | 17141 | 3.6% |
| o | 16782 | 3.6% |
| i | 16117 | 3.4% |
| Other values (42) | 254794 |
Common
| Value | Count | Frequency (%) |
| 65653 | ||
| / | 8725 | 9.1% |
| 5 | 5996 | 6.2% |
| 1 | 3466 | 3.6% |
| 8 | 2088 | 2.2% |
| 0 | 1848 | 1.9% |
| - | 1667 | 1.7% |
| 6 | 1280 | 1.3% |
| 4 | 1200 | 1.2% |
| 3 | 1120 | 1.2% |
| Other values (16) | 3241 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 568166 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 65653 | 11.6% | |
| a | 30420 | 5.4% |
| e | 28576 | 5.0% |
| A | 27394 | 4.8% |
| r | 22026 | 3.9% |
| E | 20689 | 3.6% |
| n | 19236 | 3.4% |
| t | 18707 | 3.3% |
| R | 17141 | 3.0% |
| o | 16782 | 3.0% |
| Other values (68) | 301542 |
block
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 307 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 43330 |
| Missing (%) | 10.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7028.819 |
| Minimum | 0 |
|---|---|
| Maximum | 99999900 |
| Zeros | 133 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 200 |
| Q1 | 1300 |
| median | 3200 |
| Q3 | 4800 |
| 95-th percentile | 9600 |
| Maximum | 99999900 |
| Range | 99999900 |
| Interquartile range (IQR) | 3500 |
Descriptive statistics
| Standard deviation | 321105.19 |
|---|---|
| Coefficient of variation (CV) | 45.684089 |
| Kurtosis | 77631.695 |
| Mean | 7028.819 |
| Median Absolute Deviation (MAD) | 1800 |
| Skewness | 254.9437 |
| Sum | 2.5609783 × 109 |
| Variance | 1.0310854 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 11707 | 2.9% |
| 700 | 9976 | 2.4% |
| 3000 | 8781 | 2.2% |
| 4000 | 8565 | 2.1% |
| 1000 | 8233 | 2.0% |
| 500 | 8060 | 2.0% |
| 800 | 7831 | 1.9% |
| 4200 | 7434 | 1.8% |
| 4300 | 7320 | 1.8% |
| 3800 | 7149 | 1.8% |
| Other values (297) | 279298 | |
| (Missing) | 43330 | 10.6% |
| Value | Count | Frequency (%) |
| 0 | 133 | < 0.1% |
| 100 | 11707 | |
| 200 | 7056 | |
| 300 | 6640 | |
| 400 | 5851 | |
| 500 | 8060 | |
| 600 | 6673 | |
| 700 | 9976 | |
| 800 | 7831 | |
| 900 | 6361 |
| Value | Count | Frequency (%) |
| 99999900 | 3 | < 0.1% |
| 18007300 | 1 | < 0.1% |
| 9999900 | 70 | < 0.1% |
| 5600900 | 1 | < 0.1% |
| 999900 | 221 | |
| 520000 | 1 | < 0.1% |
| 180000 | 1 | < 0.1% |
| 154000 | 1 | < 0.1% |
| 147000 | 1 | < 0.1% |
| 140000 | 1 | < 0.1% |
ldmk
Categorical
MISSING  UNIFORM 
| Distinct | 36 |
|---|---|
| Distinct (%) | 87.8% |
| Missing | 407643 |
| Missing (%) | > 99.9% |
| Memory size | 6.2 MiB |
| 15nb exit | |
|---|---|
| North Cove Park Pacific beach | 2 |
| i805/43rd St | 2 |
| BALBOA PARK - SPANISH VILLAGE | 1 |
| NB I-15 AT AERO DRIVE | 1 |
| Other values (31) |
Length
| Max length | 41 |
|---|---|
| Median length | 29 |
| Mean length | 19.390244 |
| Min length | 8 |
Characters and Unicode
| Total characters | 795 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 33 ? |
|---|---|
| Unique (%) | 80.5% |
Sample
| 1st row | sr905 / i805 |
|---|---|
| 2nd row | North Cove Park Pacific beach |
| 3rd row | North Cove Park Pacific beach |
| 4th row | I15 / I8 |
| 5th row | ON TROLLEY IN SANTEE |
Common Values
| Value | Count | Frequency (%) |
| 15nb exit | 4 | < 0.1% |
| North Cove Park Pacific beach | 2 | < 0.1% |
| i805/43rd St | 2 | < 0.1% |
| BALBOA PARK - SPANISH VILLAGE | 1 | < 0.1% |
| NB I-15 AT AERO DRIVE | 1 | < 0.1% |
| NORTHBOUND INTERSTATE-15/AERO DRIVE | 1 | < 0.1% |
| North Cove Public Beach | 1 | < 0.1% |
| Convention Center | 1 | < 0.1% |
| de anza cove | 1 | < 0.1% |
| NB I-15 @ BALBOA AVE | 1 | < 0.1% |
| Other values (26) | 26 | < 0.1% |
| (Missing) | 407643 |
Length
| Value | Count | Frequency (%) |
| at | 9 | 5.9% |
| 8 | 5.3% | |
| sb | 6 | 3.9% |
| park | 6 | 3.9% |
| balboa | 5 | 3.3% |
| and | 5 | 3.3% |
| i-15 | 5 | 3.3% |
| 15nb | 4 | 2.6% |
| nb | 4 | 2.6% |
| exit | 4 | 2.6% |
| Other values (66) | 96 |
Most occurring characters
| Value | Count | Frequency (%) |
| 111 | 14.0% | |
| A | 42 | 5.3% |
| E | 40 | 5.0% |
| T | 32 | 4.0% |
| R | 30 | 3.8% |
| B | 29 | 3.6% |
| a | 29 | 3.6% |
| I | 27 | 3.4% |
| S | 26 | 3.3% |
| N | 25 | 3.1% |
| Other values (48) | 404 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 361 | |
| Lowercase Letter | 218 | |
| Space Separator | 111 | 14.0% |
| Decimal Number | 80 | 10.1% |
| Dash Punctuation | 13 | 1.6% |
| Other Punctuation | 12 | 1.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 42 | |
| E | 40 | |
| T | 32 | |
| R | 30 | |
| B | 29 | 8.0% |
| I | 27 | 7.5% |
| S | 26 | 7.2% |
| N | 25 | 6.9% |
| O | 22 | 6.1% |
| D | 14 | 3.9% |
| Other values (12) | 74 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 29 | |
| e | 22 | |
| n | 18 | 8.3% |
| t | 18 | 8.3% |
| o | 17 | 7.8% |
| r | 17 | 7.8% |
| i | 14 | 6.4% |
| b | 12 | 5.5% |
| c | 11 | 5.0% |
| d | 9 | 4.1% |
| Other values (12) | 51 |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 25 | |
| 1 | 16 | |
| 0 | 8 | 10.0% |
| 8 | 7 | 8.8% |
| 4 | 6 | 7.5% |
| 9 | 6 | 7.5% |
| 3 | 5 | 6.2% |
| 6 | 4 | 5.0% |
| 2 | 2 | 2.5% |
| 7 | 1 | 1.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 7 | |
| @ | 5 |
Space Separator
| Value | Count | Frequency (%) |
| 111 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 13 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 579 | |
| Common | 216 | 27.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 42 | 7.3% |
| E | 40 | 6.9% |
| T | 32 | 5.5% |
| R | 30 | 5.2% |
| B | 29 | 5.0% |
| a | 29 | 5.0% |
| I | 27 | 4.7% |
| S | 26 | 4.5% |
| N | 25 | 4.3% |
| O | 22 | 3.8% |
| Other values (34) | 277 |
Common
| Value | Count | Frequency (%) |
| 111 | ||
| 5 | 25 | 11.6% |
| 1 | 16 | 7.4% |
| - | 13 | 6.0% |
| 0 | 8 | 3.7% |
| / | 7 | 3.2% |
| 8 | 7 | 3.2% |
| 4 | 6 | 2.8% |
| 9 | 6 | 2.8% |
| 3 | 5 | 2.3% |
| Other values (4) | 12 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 795 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 111 | 14.0% | |
| A | 42 | 5.3% |
| E | 40 | 5.0% |
| T | 32 | 4.0% |
| R | 30 | 3.8% |
| B | 29 | 3.6% |
| a | 29 | 3.6% |
| I | 27 | 3.4% |
| S | 26 | 3.3% |
| N | 25 | 3.1% |
| Other values (48) | 404 |
street
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 44668 |
|---|---|
| Distinct (%) | 11.4% |
| Missing | 16834 |
| Missing (%) | 4.1% |
| Memory size | 6.2 MiB |
| El Cajon Blvd | 2488 |
|---|---|
| el cajon blvd | 1577 |
| imperial ave | 1551 |
| imperial | 1469 |
| garnet | 1367 |
| Other values (44663) |
Length
| Max length | 43 |
|---|---|
| Median length | 36 |
| Mean length | 10.67244 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4171323 |
|---|---|
| Distinct characters | 82 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 24136 ? |
|---|---|
| Unique (%) | 6.2% |
Sample
| 1st row | UNIVERSITY |
|---|---|
| 2nd row | hillside dr |
| 3rd row | ocean blvd |
| 4th row | garnet |
| 5th row | coronado |
Common Values
| Value | Count | Frequency (%) |
| El Cajon Blvd | 2488 | 0.6% |
| el cajon blvd | 1577 | 0.4% |
| imperial ave | 1551 | 0.4% |
| imperial | 1469 | 0.4% |
| garnet | 1367 | 0.3% |
| university ave | 1270 | 0.3% |
| University Ave | 1240 | 0.3% |
| university | 1221 | 0.3% |
| EL CAJON BLVD | 1123 | 0.3% |
| commercial | 1047 | 0.3% |
| Other values (44658) | 376497 | |
| (Missing) | 16834 | 4.1% |
Length
| Value | Count | Frequency (%) |
| ave | 47927 | 6.1% |
| st | 39901 | 5.1% |
| street | 37969 | 4.9% |
| blvd | 26679 | 3.4% |
| avenue | 17957 | 2.3% |
| rd | 15152 | 1.9% |
| dr | 12807 | 1.6% |
| mission | 11774 | 1.5% |
| road | 10459 | 1.3% |
| el | 9610 | 1.2% |
| Other values (10930) | 549297 |
Most occurring characters
| Value | Count | Frequency (%) |
| 389073 | 9.3% | |
| e | 299210 | 7.2% |
| a | 257248 | 6.2% |
| t | 210603 | 5.0% |
| r | 205794 | 4.9% |
| n | 163322 | 3.9% |
| A | 155793 | 3.7% |
| o | 151343 | 3.6% |
| i | 143971 | 3.5% |
| l | 135009 | 3.2% |
| Other values (72) | 2059957 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2348150 | |
| Uppercase Letter | 1323188 | |
| Space Separator | 389073 | 9.3% |
| Decimal Number | 99771 | 2.4% |
| Other Punctuation | 9179 | 0.2% |
| Dash Punctuation | 1695 | < 0.1% |
| Open Punctuation | 110 | < 0.1% |
| Close Punctuation | 108 | < 0.1% |
| Modifier Symbol | 46 | < 0.1% |
| Math Symbol | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 299210 | |
| a | 257248 | |
| t | 210603 | 9.0% |
| r | 205794 | 8.8% |
| n | 163322 | 7.0% |
| o | 151343 | 6.4% |
| i | 143971 | 6.1% |
| l | 135009 | 5.7% |
| s | 132042 | 5.6% |
| v | 100060 | 4.3% |
| Other values (16) | 549548 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 155793 | |
| E | 128833 | 9.7% |
| R | 111686 | 8.4% |
| S | 109975 | 8.3% |
| T | 83749 | 6.3% |
| N | 75814 | 5.7% |
| I | 72833 | 5.5% |
| O | 69409 | 5.2% |
| L | 68521 | 5.2% |
| D | 62917 | 4.8% |
| Other values (16) | 383658 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7202 | |
| / | 1209 | 13.2% |
| & | 330 | 3.6% |
| # | 182 | 2.0% |
| @ | 95 | 1.0% |
| ' | 70 | 0.8% |
| , | 54 | 0.6% |
| : | 22 | 0.2% |
| ; | 11 | 0.1% |
| \ | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 19805 | |
| 5 | 15134 | |
| 4 | 14014 | |
| 3 | 10710 | |
| 0 | 9419 | |
| 6 | 9332 | |
| 7 | 6912 | 6.9% |
| 2 | 6662 | 6.7% |
| 8 | 4663 | 4.7% |
| 9 | 3120 | 3.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 108 | |
| [ | 2 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 389073 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1695 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 108 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 46 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 2 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3671338 | |
| Common | 499985 | 12.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 299210 | 8.1% |
| a | 257248 | 7.0% |
| t | 210603 | 5.7% |
| r | 205794 | 5.6% |
| n | 163322 | 4.4% |
| A | 155793 | 4.2% |
| o | 151343 | 4.1% |
| i | 143971 | 3.9% |
| l | 135009 | 3.7% |
| s | 132042 | 3.6% |
| Other values (42) | 1817003 |
Common
| Value | Count | Frequency (%) |
| 389073 | ||
| 1 | 19805 | 4.0% |
| 5 | 15134 | 3.0% |
| 4 | 14014 | 2.8% |
| 3 | 10710 | 2.1% |
| 0 | 9419 | 1.9% |
| 6 | 9332 | 1.9% |
| . | 7202 | 1.4% |
| 7 | 6912 | 1.4% |
| 2 | 6662 | 1.3% |
| Other values (20) | 11722 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4171323 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 389073 | 9.3% | |
| e | 299210 | 7.2% |
| a | 257248 | 6.2% |
| t | 210603 | 5.0% |
| r | 205794 | 4.9% |
| n | 163322 | 3.9% |
| A | 155793 | 3.7% |
| o | 151343 | 3.6% |
| i | 143971 | 3.5% |
| l | 135009 | 3.2% |
| Other values (72) | 2059957 |
hw_exit
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 2211 |
|---|---|
| Distinct (%) | 72.1% |
| Missing | 404618 |
| Missing (%) | 99.2% |
| Memory size | 6.2 MiB |
| NB I-15 | 36 |
|---|---|
| I-805/PLAZA BOULEVARD | 31 |
| I-805/SR-54 | 29 |
| SR 905 | 28 |
| SB I-15 | 26 |
| Other values (2206) |
Length
| Max length | 60 |
|---|---|
| Median length | 44 |
| Mean length | 18.403783 |
| Min length | 2 |
Characters and Unicode
| Total characters | 56426 |
|---|---|
| Distinct characters | 74 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1907 ? |
|---|---|
| Unique (%) | 62.2% |
Sample
| 1st row | n/b 5 @ sea world |
|---|---|
| 2nd row | I15NB @ AERO DR |
| 3rd row | wB 8 @ WARING |
| 4th row | 15 AT MIRAMAR |
| 5th row | 15 AT 163 |
Common Values
| Value | Count | Frequency (%) |
| NB I-15 | 36 | < 0.1% |
| I-805/PLAZA BOULEVARD | 31 | < 0.1% |
| I-805/SR-54 | 29 | < 0.1% |
| SR 905 | 28 | < 0.1% |
| SB I-15 | 26 | < 0.1% |
| I-805/43RD STREET | 23 | < 0.1% |
| I-805/H STREET | 19 | < 0.1% |
| NB 805 AT SR-163 | 18 | < 0.1% |
| NB 805 AT MURRAY RIDGE ROAD | 14 | < 0.1% |
| I-5/VIA DE SAN YSIDRO | 14 | < 0.1% |
| Other values (2201) | 2828 | 0.7% |
| (Missing) | 404618 |
Length
| Value | Count | Frequency (%) |
| at | 960 | 8.2% |
| 15 | 588 | 5.0% |
| sb | 515 | 4.4% |
| 500 | 4.3% | |
| nb | 440 | 3.8% |
| 805 | 300 | 2.6% |
| i-15 | 266 | 2.3% |
| street | 229 | 2.0% |
| and | 206 | 1.8% |
| road | 185 | 1.6% |
| Other values (803) | 7487 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8622 | 15.3% | |
| 5 | 2773 | 4.9% |
| A | 2373 | 4.2% |
| E | 2281 | 4.0% |
| T | 2227 | 3.9% |
| a | 2185 | 3.9% |
| R | 2070 | 3.7% |
| I | 1803 | 3.2% |
| S | 1730 | 3.1% |
| N | 1479 | 2.6% |
| Other values (64) | 28883 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 21601 | |
| Lowercase Letter | 15708 | |
| Space Separator | 8622 | 15.3% |
| Decimal Number | 7684 | 13.6% |
| Other Punctuation | 1495 | 2.6% |
| Dash Punctuation | 1304 | 2.3% |
| Open Punctuation | 5 | < 0.1% |
| Close Punctuation | 5 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2373 | |
| E | 2281 | |
| T | 2227 | |
| R | 2070 | |
| I | 1803 | |
| S | 1730 | |
| N | 1479 | 6.8% |
| O | 1256 | 5.8% |
| B | 1235 | 5.7% |
| D | 823 | 3.8% |
| Other values (16) | 4324 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2185 | |
| r | 1442 | 9.2% |
| t | 1414 | 9.0% |
| e | 1414 | 9.0% |
| n | 1144 | 7.3% |
| s | 1065 | 6.8% |
| o | 1004 | 6.4% |
| b | 963 | 6.1% |
| i | 759 | 4.8% |
| l | 658 | 4.2% |
| Other values (16) | 3660 |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 2773 | |
| 1 | 1436 | |
| 8 | 1091 | 14.2% |
| 0 | 834 | 10.9% |
| 6 | 401 | 5.2% |
| 4 | 298 | 3.9% |
| 3 | 287 | 3.7% |
| 9 | 277 | 3.6% |
| 2 | 186 | 2.4% |
| 7 | 101 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1091 | |
| @ | 344 | 23.0% |
| , | 29 | 1.9% |
| . | 16 | 1.1% |
| & | 14 | 0.9% |
| ! | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 8622 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1304 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37309 | |
| Common | 19117 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 2373 | 6.4% |
| E | 2281 | 6.1% |
| T | 2227 | 6.0% |
| a | 2185 | 5.9% |
| R | 2070 | 5.5% |
| I | 1803 | 4.8% |
| S | 1730 | 4.6% |
| N | 1479 | 4.0% |
| r | 1442 | 3.9% |
| t | 1414 | 3.8% |
| Other values (42) | 18305 |
Common
| Value | Count | Frequency (%) |
| 8622 | ||
| 5 | 2773 | 14.5% |
| 1 | 1436 | 7.5% |
| - | 1304 | 6.8% |
| 8 | 1091 | 5.7% |
| / | 1091 | 5.7% |
| 0 | 834 | 4.4% |
| 6 | 401 | 2.1% |
| @ | 344 | 1.8% |
| 4 | 298 | 1.6% |
| Other values (12) | 923 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56426 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8622 | 15.3% | |
| 5 | 2773 | 4.9% |
| A | 2373 | 4.2% |
| E | 2281 | 4.0% |
| T | 2227 | 3.9% |
| a | 2185 | 3.9% |
| R | 2070 | 3.7% |
| I | 1803 | 3.2% |
| S | 1730 | 3.1% |
| N | 1479 | 2.6% |
| Other values (64) | 28883 |
is_school
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| 0 | |
|---|---|
| 1 | 322 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 407684 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 407362 | |
| 1 | 322 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 407362 | |
| 1 | 322 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 407362 | |
| 1 | 322 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 407684 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 407362 | |
| 1 | 322 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 407684 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 407362 | |
| 1 | 322 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 407684 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 407362 | |
| 1 | 322 | 0.1% |
school_name
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 85 |
|---|---|
| Distinct (%) | 26.4% |
| Missing | 407362 |
| Missing (%) | 99.9% |
| Memory size | 6.2 MiB |
| Ibarra Elementary (San Diego Unified) 37683380108290 | |
|---|---|
| Rancho Bernardo High (Poway Unified) 37682963730819 | 16 |
| Del Norte High (Poway Unified) 37682960118935 | 15 |
| Serra High (San Diego Unified) 37683383730173 | 13 |
| The O'Farrell Charter (San Diego Unified) 37683386061964 | 13 |
| Other values (80) |
Length
| Max length | 69 |
|---|---|
| Median length | 66 |
| Mean length | 52.751553 |
| Min length | 34 |
Characters and Unicode
| Total characters | 16986 |
|---|---|
| Distinct characters | 65 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 32 ? |
|---|---|
| Unique (%) | 9.9% |
Sample
| 1st row | Garfield Elementary (San Diego Unified) 37683386039655 |
|---|---|
| 2nd row | Garfield Elementary (San Diego Unified) 37683386039655 |
| 3rd row | Garfield Elementary (San Diego Unified) 37683386039655 |
| 4th row | Garfield Elementary (San Diego Unified) 37683386039655 |
| 5th row | Grant K-8 (San Diego Unified) 37683386039671 |
Common Values
| Value | Count | Frequency (%) |
| Ibarra Elementary (San Diego Unified) 37683380108290 | 27 | < 0.1% |
| Rancho Bernardo High (Poway Unified) 37682963730819 | 16 | < 0.1% |
| Del Norte High (Poway Unified) 37682960118935 | 15 | < 0.1% |
| Serra High (San Diego Unified) 37683383730173 | 13 | < 0.1% |
| The O'Farrell Charter (San Diego Unified) 37683386061964 | 13 | < 0.1% |
| De Portola Middle (San Diego Unified) 37683386106181 | 13 | < 0.1% |
| Montgomery Senior High (Sweetwater Union High) 37684113738234 | 13 | < 0.1% |
| Torrey Pines High (San Dieguito Union High) 37683463730033 | 11 | < 0.1% |
| Sunset Elementary (San Ysidro Elementary) 37683796093264 | 11 | < 0.1% |
| San Ysidro High (Sweetwater Union High) 37684113731502 | 9 | < 0.1% |
| Other values (75) | 181 | < 0.1% |
| (Missing) | 407362 |
Length
| Value | Count | Frequency (%) |
| san | 222 | 10.8% |
| unified | 221 | 10.7% |
| high | 165 | 8.0% |
| diego | 149 | 7.2% |
| elementary | 144 | 7.0% |
| poway | 72 | 3.5% |
| union | 62 | 3.0% |
| middle | 50 | 2.4% |
| ysidro | 48 | 2.3% |
| sweetwater | 31 | 1.5% |
| Other values (200) | 896 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1738 | 10.2% | |
| e | 1193 | 7.0% |
| i | 1088 | 6.4% |
| 3 | 1054 | 6.2% |
| n | 915 | 5.4% |
| a | 813 | 4.8% |
| 6 | 688 | 4.1% |
| 8 | 636 | 3.7% |
| r | 616 | 3.6% |
| o | 589 | 3.5% |
| Other values (55) | 7656 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8285 | |
| Decimal Number | 4515 | |
| Uppercase Letter | 1774 | 10.4% |
| Space Separator | 1738 | 10.2% |
| Close Punctuation | 322 | 1.9% |
| Open Punctuation | 322 | 1.9% |
| Other Punctuation | 23 | 0.1% |
| Dash Punctuation | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1193 | |
| i | 1088 | |
| n | 915 | |
| a | 813 | |
| r | 616 | |
| o | 589 | 7.1% |
| d | 431 | 5.2% |
| t | 402 | 4.9% |
| g | 375 | 4.5% |
| l | 345 | 4.2% |
| Other values (14) | 1518 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 331 | |
| U | 285 | |
| D | 214 | |
| H | 179 | |
| E | 149 | |
| P | 113 | 6.4% |
| M | 102 | 5.7% |
| C | 90 | 5.1% |
| Y | 48 | 2.7% |
| B | 46 | 2.6% |
| Other values (14) | 217 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1054 | |
| 6 | 688 | |
| 8 | 636 | |
| 7 | 542 | |
| 0 | 467 | |
| 1 | 377 | 8.3% |
| 9 | 280 | 6.2% |
| 2 | 206 | 4.6% |
| 4 | 147 | 3.3% |
| 5 | 118 | 2.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 13 | |
| . | 9 | |
| / | 1 | 4.3% |
Space Separator
| Value | Count | Frequency (%) |
| 1738 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 322 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 322 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10059 | |
| Common | 6927 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1193 | 11.9% |
| i | 1088 | 10.8% |
| n | 915 | 9.1% |
| a | 813 | 8.1% |
| r | 616 | 6.1% |
| o | 589 | 5.9% |
| d | 431 | 4.3% |
| t | 402 | 4.0% |
| g | 375 | 3.7% |
| l | 345 | 3.4% |
| Other values (38) | 3292 |
Common
| Value | Count | Frequency (%) |
| 1738 | ||
| 3 | 1054 | |
| 6 | 688 | 9.9% |
| 8 | 636 | 9.2% |
| 7 | 542 | 7.8% |
| 0 | 467 | 6.7% |
| 1 | 377 | 5.4% |
| ) | 322 | 4.6% |
| ( | 322 | 4.6% |
| 9 | 280 | 4.0% |
| Other values (7) | 501 | 7.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16986 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1738 | 10.2% | |
| e | 1193 | 7.0% |
| i | 1088 | 6.4% |
| 3 | 1054 | 6.2% |
| n | 915 | 5.4% |
| a | 813 | 4.8% |
| 6 | 688 | 4.1% |
| 8 | 636 | 3.7% |
| r | 616 | 3.6% |
| o | 589 | 3.5% |
| Other values (55) | 7656 |
city
Categorical
| Distinct | 46 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| SAN DIEGO | |
|---|---|
| SAN YSIDRO | 2072 |
| CHULA VISTA | 1033 |
| NATIONAL CITY | 783 |
| EL CAJON | 415 |
| Other values (41) | 2528 |
Length
| Max length | 36 |
|---|---|
| Median length | 9 |
| Mean length | 9.0178766 |
| Min length | 4 |
Characters and Unicode
| Total characters | 3676444 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | SAN DIEGO |
|---|---|
| 2nd row | LA JOLLA |
| 3rd row | SAN DIEGO |
| 4th row | SAN DIEGO |
| 5th row | SAN DIEGO |
Common Values
| Value | Count | Frequency (%) |
| SAN DIEGO | 400853 | |
| SAN YSIDRO | 2072 | 0.5% |
| CHULA VISTA | 1033 | 0.3% |
| NATIONAL CITY | 783 | 0.2% |
| EL CAJON | 415 | 0.1% |
| ESCONDIDO | 398 | 0.1% |
| LA MESA | 310 | 0.1% |
| LA JOLLA | 287 | 0.1% |
| LEMON GROVE | 282 | 0.1% |
| SANTEE | 209 | 0.1% |
| Other values (36) | 1042 | 0.3% |
Length
| Value | Count | Frequency (%) |
| san | 403140 | |
| diego | 400856 | |
| ysidro | 2072 | 0.3% |
| vista | 1034 | 0.1% |
| chula | 1033 | 0.1% |
| national | 783 | 0.1% |
| city | 783 | 0.1% |
| la | 597 | 0.1% |
| el | 415 | 0.1% |
| cajon | 415 | 0.1% |
| Other values (49) | 3319 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 409647 | |
| S | 407720 | |
| 406763 | ||
| I | 406683 | |
| O | 406433 | |
| N | 406432 | |
| D | 403934 | |
| E | 403924 | |
| G | 401346 | |
| L | 4731 | 0.1% |
| Other values (15) | 18831 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3269678 | |
| Space Separator | 406763 | 11.1% |
| Dash Punctuation | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 409647 | |
| S | 407720 | |
| I | 406683 | |
| O | 406433 | |
| N | 406432 | |
| D | 403934 | |
| E | 403924 | |
| G | 401346 | |
| L | 4731 | 0.1% |
| Y | 3255 | 0.1% |
| Other values (13) | 15573 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 406763 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3269678 | |
| Common | 406766 | 11.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 409647 | |
| S | 407720 | |
| I | 406683 | |
| O | 406433 | |
| N | 406432 | |
| D | 403934 | |
| E | 403924 | |
| G | 401346 | |
| L | 4731 | 0.1% |
| Y | 3255 | 0.1% |
| Other values (13) | 15573 | 0.5% |
Common
| Value | Count | Frequency (%) |
| 406763 | ||
| - | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3676444 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 409647 | |
| S | 407720 | |
| 406763 | ||
| I | 406683 | |
| O | 406433 | |
| N | 406432 | |
| D | 403934 | |
| E | 403924 | |
| G | 401346 | |
| L | 4731 | 0.1% |
| Other values (15) | 18831 | 0.5% |
beat
Real number (ℝ)
| Distinct | 126 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 511.45697 |
| Minimum | 111 |
|---|---|
| Maximum | 999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 MiB |
Quantile statistics
| Minimum | 111 |
|---|---|
| 5-th percentile | 121 |
| Q1 | 315 |
| median | 521 |
| Q3 | 628 |
| 95-th percentile | 931 |
| Maximum | 999 |
| Range | 888 |
| Interquartile range (IQR) | 313 |
Descriptive statistics
| Standard deviation | 242.57677 |
|---|---|
| Coefficient of variation (CV) | 0.47428578 |
| Kurtosis | -0.77389507 |
| Mean | 511.45697 |
| Median Absolute Deviation (MAD) | 194 |
| Skewness | -0.070033741 |
| Sum | 2.0851282 × 108 |
| Variance | 58843.489 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 521 | 29982 | 7.4% |
| 122 | 24592 | 6.0% |
| 611 | 16419 | 4.0% |
| 614 | 10569 | 2.6% |
| 524 | 10192 | 2.5% |
| 712 | 9851 | 2.4% |
| 512 | 9651 | 2.4% |
| 813 | 9582 | 2.4% |
| 999 | 9483 | 2.3% |
| 121 | 8744 | 2.1% |
| Other values (116) | 268619 |
| Value | Count | Frequency (%) |
| 111 | 4467 | 1.1% |
| 112 | 1542 | 0.4% |
| 113 | 1514 | 0.4% |
| 114 | 3534 | 0.9% |
| 115 | 4463 | 1.1% |
| 116 | 3624 | 0.9% |
| 121 | 8744 | 2.1% |
| 122 | 24592 | |
| 123 | 4779 | 1.2% |
| 124 | 4479 | 1.1% |
| Value | Count | Frequency (%) |
| 999 | 9483 | |
| 937 | 1047 | 0.3% |
| 936 | 393 | 0.1% |
| 935 | 691 | 0.2% |
| 934 | 5119 | |
| 933 | 1784 | 0.4% |
| 932 | 481 | 0.1% |
| 931 | 4407 | |
| 841 | 715 | 0.2% |
| 839 | 1133 | 0.3% |
beat_name
Categorical
| Distinct | 127 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| East Village 521 | 29982 |
|---|---|
| Pacific Beach 122 | 24592 |
| Midway District 611 | 16419 |
| Ocean Beach 614 | 10569 |
| Core-Columbia 524 | 10192 |
| Other values (122) |
Length
| Max length | 25 |
|---|---|
| Median length | 22 |
| Mean length | 15.909476 |
| Min length | 10 |
Characters and Unicode
| Total characters | 6486039 |
|---|---|
| Distinct characters | 64 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Cherokee Point 839 |
|---|---|
| 2nd row | La Jolla 124 |
| 3rd row | Pacific Beach 122 |
| 4th row | Pacific Beach 122 |
| 5th row | Ocean Beach 614 |
Common Values
| Value | Count | Frequency (%) |
| East Village 521 | 29982 | 7.4% |
| Pacific Beach 122 | 24592 | 6.0% |
| Midway District 611 | 16419 | 4.0% |
| Ocean Beach 614 | 10569 | 2.6% |
| Core-Columbia 524 | 10192 | 2.5% |
| San Ysidro 712 | 9851 | 2.4% |
| Logan Heights 512 | 9651 | 2.4% |
| North Park 813 | 9582 | 2.4% |
| Unknown 999 | 9411 | 2.3% |
| Mission Beach 121 | 8744 | 2.1% |
| Other values (117) | 268691 |
Length
| Value | Count | Frequency (%) |
| east | 48357 | 4.1% |
| beach | 43905 | 3.7% |
| park | 41321 | 3.5% |
| mesa | 35028 | 3.0% |
| village | 32954 | 2.8% |
| 521 | 29982 | 2.5% |
| mission | 26001 | 2.2% |
| pacific | 24592 | 2.1% |
| 122 | 24592 | 2.1% |
| heights | 21711 | 1.8% |
| Other values (263) | 851684 |
Most occurring characters
| Value | Count | Frequency (%) |
| 772443 | 11.9% | |
| a | 565930 | 8.7% |
| e | 377516 | 5.8% |
| i | 375627 | 5.8% |
| 1 | 309397 | 4.8% |
| l | 292727 | 4.5% |
| r | 273757 | 4.2% |
| 2 | 272499 | 4.2% |
| s | 268438 | 4.1% |
| t | 259143 | 4.0% |
| Other values (54) | 2718562 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3682859 | |
| Decimal Number | 1223052 | 18.9% |
| Uppercase Letter | 789444 | 12.2% |
| Space Separator | 772443 | 11.9% |
| Dash Punctuation | 10192 | 0.2% |
| Other Punctuation | 8049 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 565930 | |
| e | 377516 | |
| i | 375627 | |
| l | 292727 | |
| r | 273757 | |
| s | 268438 | |
| t | 259143 | |
| o | 251522 | |
| n | 233816 | 6.3% |
| c | 157915 | 4.3% |
| Other values (16) | 626468 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 109569 | |
| P | 80851 | |
| B | 75772 | |
| C | 72612 | |
| V | 69291 | |
| E | 55874 | 7.1% |
| H | 46169 | 5.8% |
| S | 41954 | 5.3% |
| L | 40574 | 5.1% |
| O | 27049 | 3.4% |
| Other values (14) | 169729 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 309397 | |
| 2 | 272499 | |
| 3 | 153382 | |
| 4 | 127141 | |
| 5 | 123115 | 10.1% |
| 6 | 83727 | 6.8% |
| 8 | 58189 | 4.8% |
| 7 | 49152 | 4.0% |
| 9 | 46450 | 3.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 5351 | |
| . | 1816 | 22.6% |
| ' | 882 | 11.0% |
Space Separator
| Value | Count | Frequency (%) |
| 772443 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10192 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4472303 | |
| Common | 2013736 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 565930 | 12.7% |
| e | 377516 | 8.4% |
| i | 375627 | 8.4% |
| l | 292727 | 6.5% |
| r | 273757 | 6.1% |
| s | 268438 | 6.0% |
| t | 259143 | 5.8% |
| o | 251522 | 5.6% |
| n | 233816 | 5.2% |
| c | 157915 | 3.5% |
| Other values (40) | 1415912 |
Common
| Value | Count | Frequency (%) |
| 772443 | ||
| 1 | 309397 | |
| 2 | 272499 | 13.5% |
| 3 | 153382 | 7.6% |
| 4 | 127141 | 6.3% |
| 5 | 123115 | 6.1% |
| 6 | 83727 | 4.2% |
| 8 | 58189 | 2.9% |
| 7 | 49152 | 2.4% |
| 9 | 46450 | 2.3% |
| Other values (4) | 18241 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6486039 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 772443 | 11.9% | |
| a | 565930 | 8.7% |
| e | 377516 | 5.8% |
| i | 375627 | 5.8% |
| 1 | 309397 | 4.8% |
| l | 292727 | 4.5% |
| r | 273757 | 4.2% |
| 2 | 272499 | 4.2% |
| s | 268438 | 4.1% |
| t | 259143 | 4.0% |
| Other values (54) | 2718562 |
is_student
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| 0 | |
|---|---|
| 1 | 162 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 407684 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 407522 | |
| 1 | 162 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 407522 | |
| 1 | 162 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 407522 | |
| 1 | 162 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 407684 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 407522 | |
| 1 | 162 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 407684 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 407522 | |
| 1 | 162 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 407684 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 407522 | |
| 1 | 162 | < 0.1% |
lim_eng
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| 0 | |
|---|---|
| 1 | 7794 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 407684 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 399890 | |
| 1 | 7794 | 1.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 399890 | |
| 1 | 7794 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 399890 | |
| 1 | 7794 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 407684 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 399890 | |
| 1 | 7794 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 407684 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 399890 | |
| 1 | 7794 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 407684 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 399890 | |
| 1 | 7794 | 1.9% |
age
Real number (ℝ)
| Distinct | 102 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.315492 |
| Minimum | 1 |
|---|---|
| Maximum | 120 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 26 |
| median | 35 |
| Q3 | 46 |
| 95-th percentile | 60 |
| Maximum | 120 |
| Range | 119 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 13.43056 |
|---|---|
| Coefficient of variation (CV) | 0.35991916 |
| Kurtosis | -0.22271582 |
| Mean | 37.315492 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.5885595 |
| Sum | 15212929 |
| Variance | 180.37995 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 30 | 58428 | |
| 40 | 42673 | 10.5% |
| 25 | 40305 | 9.9% |
| 50 | 35877 | 8.8% |
| 35 | 33339 | 8.2% |
| 45 | 24881 | 6.1% |
| 60 | 20837 | 5.1% |
| 20 | 20273 | 5.0% |
| 55 | 16130 | 4.0% |
| 21 | 6938 | 1.7% |
| Other values (92) | 108003 |
| Value | Count | Frequency (%) |
| 1 | 12 | < 0.1% |
| 2 | 5 | < 0.1% |
| 3 | 3 | < 0.1% |
| 4 | 13 | < 0.1% |
| 5 | 38 | < 0.1% |
| 6 | 13 | < 0.1% |
| 7 | 37 | < 0.1% |
| 8 | 71 | < 0.1% |
| 9 | 43 | < 0.1% |
| 10 | 320 |
| Value | Count | Frequency (%) |
| 120 | 3 | < 0.1% |
| 116 | 1 | < 0.1% |
| 100 | 15 | |
| 99 | 18 | |
| 98 | 3 | < 0.1% |
| 97 | 4 | < 0.1% |
| 96 | 1 | < 0.1% |
| 95 | 20 | |
| 94 | 6 | < 0.1% |
| 93 | 7 | < 0.1% |
gender_words
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 90 |
| Missing (%) | < 0.1% |
| Memory size | 6.2 MiB |
| Male | |
|---|---|
| Female | |
| Transgender man/boy | 548 |
| Transgender woman/girl | 492 |
Length
| Max length | 22 |
|---|---|
| Median length | 4 |
| Mean length | 4.5758672 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1865096 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Male |
|---|---|
| 2nd row | Female |
| 3rd row | Female |
| 4th row | Male |
| 5th row | Male |
Common Values
| Value | Count | Frequency (%) |
| Male | 297732 | |
| Female | 108822 | 26.7% |
| Transgender man/boy | 548 | 0.1% |
| Transgender woman/girl | 492 | 0.1% |
| (Missing) | 90 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 297732 | |
| female | 108822 | 26.6% |
| transgender | 1040 | 0.3% |
| man/boy | 548 | 0.1% |
| woman/girl | 492 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 517456 | |
| a | 408634 | |
| l | 407046 | |
| M | 297732 | |
| m | 109862 | 5.9% |
| F | 108822 | 5.8% |
| n | 3120 | 0.2% |
| r | 2572 | 0.1% |
| g | 1532 | 0.1% |
| 1040 | 0.1% | |
| Other values (9) | 7280 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1455422 | |
| Uppercase Letter | 407594 | 21.9% |
| Space Separator | 1040 | 0.1% |
| Other Punctuation | 1040 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 517456 | |
| a | 408634 | |
| l | 407046 | |
| m | 109862 | 7.5% |
| n | 3120 | 0.2% |
| r | 2572 | 0.2% |
| g | 1532 | 0.1% |
| o | 1040 | 0.1% |
| s | 1040 | 0.1% |
| d | 1040 | 0.1% |
| Other values (4) | 2080 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 297732 | |
| F | 108822 | 26.7% |
| T | 1040 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 1040 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1040 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1863016 | |
| Common | 2080 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 517456 | |
| a | 408634 | |
| l | 407046 | |
| M | 297732 | |
| m | 109862 | 5.9% |
| F | 108822 | 5.8% |
| n | 3120 | 0.2% |
| r | 2572 | 0.1% |
| g | 1532 | 0.1% |
| o | 1040 | 0.1% |
| Other values (7) | 5200 | 0.3% |
Common
| Value | Count | Frequency (%) |
| 1040 | ||
| / | 1040 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1865096 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 517456 | |
| a | 408634 | |
| l | 407046 | |
| M | 297732 | |
| m | 109862 | 5.9% |
| F | 108822 | 5.8% |
| n | 3120 | 0.2% |
| r | 2572 | 0.1% |
| g | 1532 | 0.1% |
| 1040 | 0.1% | |
| Other values (9) | 7280 | 0.4% |
is_gendnc
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| 0 | |
|---|---|
| 1 | 177 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 407684 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 407507 | |
| 1 | 177 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 407507 | |
| 1 | 177 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 407507 | |
| 1 | 177 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 407684 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 407507 | |
| 1 | 177 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 407684 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 407507 | |
| 1 | 177 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 407684 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 407507 | |
| 1 | 177 | < 0.1% |
gender_code
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 548 |
| 4 | 492 |
| 0 | 90 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 407684 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 297732 | |
| 2 | 108822 | 26.7% |
| 3 | 548 | 0.1% |
| 4 | 492 | 0.1% |
| 0 | 90 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 297732 | |
| 2 | 108822 | 26.7% |
| 3 | 548 | 0.1% |
| 4 | 492 | 0.1% |
| 0 | 90 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 297732 | |
| 2 | 108822 | 26.7% |
| 3 | 548 | 0.1% |
| 4 | 492 | 0.1% |
| 0 | 90 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 407684 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 297732 | |
| 2 | 108822 | 26.7% |
| 3 | 548 | 0.1% |
| 4 | 492 | 0.1% |
| 0 | 90 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 407684 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 297732 | |
| 2 | 108822 | 26.7% |
| 3 | 548 | 0.1% |
| 4 | 492 | 0.1% |
| 0 | 90 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 407684 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 297732 | |
| 2 | 108822 | 26.7% |
| 3 | 548 | 0.1% |
| 4 | 492 | 0.1% |
| 0 | 90 | < 0.1% |
gendnc_code
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 407507 |
| Missing (%) | > 99.9% |
| Memory size | 6.2 MiB |
| 5.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 531 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 5.0 |
|---|---|
| 2nd row | 5.0 |
| 3rd row | 5.0 |
| 4th row | 5.0 |
| 5th row | 5.0 |
Common Values
| Value | Count | Frequency (%) |
| 5.0 | 177 | < 0.1% |
| (Missing) | 407507 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 5.0 | 177 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 177 | |
| . | 177 | |
| 0 | 177 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 354 | |
| Other Punctuation | 177 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 177 | |
| 0 | 177 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 177 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 531 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 177 | |
| . | 177 | |
| 0 | 177 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 531 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 177 | |
| . | 177 | |
| 0 | 177 |
lgbt
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
| False | |
|---|---|
| True | 10724 |
| Value | Count | Frequency (%) |
| False | 396960 | |
| True | 10724 | 2.6% |
race
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| white | |
|---|---|
| hisp | |
| black | |
| asian | |
| nhopi | 3112 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.7044696 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1917937 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | hisp |
|---|---|
| 2nd row | white |
| 3rd row | white |
| 4th row | hisp |
| 5th row | black |
Common Values
| Value | Count | Frequency (%) |
| white | 170777 | |
| hisp | 119669 | |
| black | 82052 | |
| asian | 31260 | 7.7% |
| nhopi | 3112 | 0.8% |
| aian | 814 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| white | 170777 | |
| hisp | 119669 | |
| black | 82052 | |
| asian | 31260 | 7.7% |
| nhopi | 3112 | 0.8% |
| aian | 814 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 325632 | |
| h | 293558 | |
| w | 170777 | |
| t | 170777 | |
| e | 170777 | |
| s | 150929 | |
| a | 146200 | |
| p | 122781 | 6.4% |
| b | 82052 | 4.3% |
| l | 82052 | 4.3% |
| Other values (4) | 202402 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1917937 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 325632 | |
| h | 293558 | |
| w | 170777 | |
| t | 170777 | |
| e | 170777 | |
| s | 150929 | |
| a | 146200 | |
| p | 122781 | 6.4% |
| b | 82052 | 4.3% |
| l | 82052 | 4.3% |
| Other values (4) | 202402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1917937 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 325632 | |
| h | 293558 | |
| w | 170777 | |
| t | 170777 | |
| e | 170777 | |
| s | 150929 | |
| a | 146200 | |
| p | 122781 | 6.4% |
| b | 82052 | 4.3% |
| l | 82052 | 4.3% |
| Other values (4) | 202402 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1917937 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 325632 | |
| h | 293558 | |
| w | 170777 | |
| t | 170777 | |
| e | 170777 | |
| s | 150929 | |
| a | 146200 | |
| p | 122781 | 6.4% |
| b | 82052 | 4.3% |
| l | 82052 | 4.3% |
| Other values (4) | 202402 |
disability
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 134 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 MiB |
| None | |
|---|---|
| Mental health condition | 13848 |
| Other disability | 1776 |
| Intellectual or developmental disability, including dementia | 626 |
| Speech impairment or limited use of language | 558 |
| Other values (129) | 2085 |
Length
| Max length | 201 |
|---|---|
| Median length | 4 |
| Mean length | 5.1228329 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2088497 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 65 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | None |
|---|---|
| 2nd row | None |
| 3rd row | None |
| 4th row | None |
| 5th row | None |
Common Values
| Value | Count | Frequency (%) |
| None | 388791 | |
| Mental health condition | 13848 | 3.4% |
| Other disability | 1776 | 0.4% |
| Intellectual or developmental disability, including dementia | 626 | 0.2% |
| Speech impairment or limited use of language | 558 | 0.1% |
| Deafness or difficulty hearing | 461 | 0.1% |
| Intellectual or developmental disability, including dementia|Mental health condition | 285 | 0.1% |
| Blind or limited vision | 268 | 0.1% |
| Mental health condition|Intellectual or developmental disability, including dementia | 241 | 0.1% |
| Mental health condition|Other disability | 124 | < 0.1% |
| Other values (124) | 706 | 0.2% |
Length
| Value | Count | Frequency (%) |
| none | 388791 | |
| health | 14978 | 3.3% |
| condition | 14399 | 3.2% |
| mental | 14372 | 3.2% |
| disability | 3359 | 0.7% |
| or | 3355 | 0.7% |
| other | 1954 | 0.4% |
| developmental | 1387 | 0.3% |
| including | 1387 | 0.3% |
| limited | 1275 | 0.3% |
| Other values (50) | 10082 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 444578 | |
| e | 438417 | |
| o | 424782 | |
| N | 388791 | |
| t | 59149 | 2.8% |
| i | 52496 | 2.5% |
| 47655 | 2.3% | |
| l | 45125 | 2.2% |
| a | 41741 | 2.0% |
| h | 33735 | 1.6% |
| Other values (20) | 112028 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1628493 | |
| Uppercase Letter | 409323 | 19.6% |
| Space Separator | 47655 | 2.3% |
| Math Symbol | 1639 | 0.1% |
| Other Punctuation | 1387 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 444578 | |
| e | 438417 | |
| o | 424782 | |
| t | 59149 | 3.6% |
| i | 52496 | 3.2% |
| l | 45125 | 2.8% |
| a | 41741 | 2.6% |
| h | 33735 | 2.1% |
| d | 25090 | 1.5% |
| c | 19323 | 1.2% |
| Other values (10) | 44057 | 2.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 388791 | |
| M | 14978 | 3.7% |
| O | 2199 | 0.5% |
| I | 1387 | 0.3% |
| S | 878 | 0.2% |
| D | 693 | 0.2% |
| B | 397 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 47655 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 1639 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1387 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2037816 | |
| Common | 50681 | 2.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 444578 | |
| e | 438417 | |
| o | 424782 | |
| N | 388791 | |
| t | 59149 | 2.9% |
| i | 52496 | 2.6% |
| l | 45125 | 2.2% |
| a | 41741 | 2.0% |
| h | 33735 | 1.7% |
| d | 25090 | 1.2% |
| Other values (17) | 83912 | 4.1% |
Common
| Value | Count | Frequency (%) |
| 47655 | ||
| | | 1639 | 3.2% |
| , | 1387 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2088497 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 444578 | |
| e | 438417 | |
| o | 424782 | |
| N | 388791 | |
| t | 59149 | 2.8% |
| i | 52496 | 2.5% |
| 47655 | 2.3% | |
| l | 45125 | 2.2% |
| a | 41741 | 2.0% |
| h | 33735 | 1.6% |
| Other values (20) | 112028 | 5.4% |
reason_words
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 6.2 MiB |
| Reasonable Suspicion | |
|---|---|
| Traffic Violation | |
| Investigation to determine whether the person was truant | 5358 |
| Known to be on Parole / Probation / PRCS / Mandatory Supervision | 5126 |
| Consensual Encounter resulting in a search | 4418 |
| Other values (3) | 3935 |
Length
| Max length | 113 |
|---|---|
| Median length | 20 |
| Mean length | 20.295608 |
| Min length | 17 |
Characters and Unicode
| Total characters | 8274093 |
|---|---|
| Distinct characters | 43 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Traffic Violation |
|---|---|
| 2nd row | Reasonable Suspicion |
| 3rd row | Reasonable Suspicion |
| 4th row | Reasonable Suspicion |
| 5th row | Reasonable Suspicion |
Common Values
| Value | Count | Frequency (%) |
| Reasonable Suspicion | 213781 | |
| Traffic Violation | 175061 | |
| Investigation to determine whether the person was truant | 5358 | 1.3% |
| Known to be on Parole / Probation / PRCS / Mandatory Supervision | 5126 | 1.3% |
| Consensual Encounter resulting in a search | 4418 | 1.1% |
| Knowledge of outstanding arrest warrant/wanted person | 3904 | 1.0% |
| Determine whether the student violated school policy | 27 | < 0.1% |
| Possible conduct warranting discipline under Education Code sections 48900, 48900.2, 48900.3, 48900.4 and 48900.7 | 4 | < 0.1% |
| (Missing) | 5 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| reasonable | 213781 | |
| suspicion | 213781 | |
| traffic | 175061 | |
| violation | 175061 | |
| 15378 | 1.6% | |
| to | 10484 | 1.1% |
| person | 9262 | 1.0% |
| determine | 5385 | 0.6% |
| whether | 5385 | 0.6% |
| the | 5385 | 0.6% |
| Other values (40) | 103274 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 997046 | |
| o | 859346 | 10.4% |
| a | 847079 | 10.2% |
| n | 710187 | 8.6% |
| 524558 | 6.3% | |
| e | 523232 | 6.3% |
| s | 478220 | 5.8% |
| l | 406797 | 4.9% |
| c | 397752 | 4.8% |
| f | 354026 | 4.3% |
| Other values (33) | 2175850 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6888154 | |
| Uppercase Letter | 841955 | 10.2% |
| Space Separator | 524558 | 6.3% |
| Other Punctuation | 19310 | 0.2% |
| Decimal Number | 116 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 997046 | |
| o | 859346 | |
| a | 847079 | |
| n | 710187 | |
| e | 523232 | |
| s | 478220 | |
| l | 406797 | 5.9% |
| c | 397752 | 5.8% |
| f | 354026 | 5.1% |
| t | 261837 | 3.8% |
| Other values (11) | 1052632 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 224033 | |
| R | 218907 | |
| V | 175061 | |
| T | 175061 | |
| P | 15382 | 1.8% |
| C | 9548 | 1.1% |
| K | 9030 | 1.1% |
| I | 5358 | 0.6% |
| M | 5126 | 0.6% |
| E | 4422 | 0.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 40 | |
| 4 | 24 | |
| 8 | 20 | |
| 9 | 20 | |
| 2 | 4 | 3.4% |
| 3 | 4 | 3.4% |
| 7 | 4 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 19282 | |
| . | 16 | 0.1% |
| , | 12 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 524558 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7730109 | |
| Common | 543984 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 997046 | |
| o | 859346 | |
| a | 847079 | |
| n | 710187 | 9.2% |
| e | 523232 | 6.8% |
| s | 478220 | 6.2% |
| l | 406797 | 5.3% |
| c | 397752 | 5.1% |
| f | 354026 | 4.6% |
| t | 261837 | 3.4% |
| Other values (22) | 1894587 |
Common
| Value | Count | Frequency (%) |
| 524558 | ||
| / | 19282 | 3.5% |
| 0 | 40 | < 0.1% |
| 4 | 24 | < 0.1% |
| 8 | 20 | < 0.1% |
| 9 | 20 | < 0.1% |
| . | 16 | < 0.1% |
| , | 12 | < 0.1% |
| 2 | 4 | < 0.1% |
| 3 | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8274093 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 997046 | |
| o | 859346 | 10.4% |
| a | 847079 | 10.2% |
| n | 710187 | 8.6% |
| 524558 | 6.3% | |
| e | 523232 | 6.3% |
| s | 478220 | 5.8% |
| l | 406797 | 4.9% |
| c | 397752 | 4.8% |
| f | 354026 | 4.3% |
| Other values (33) | 2175850 |
reasonid
Real number (ℝ)
| Distinct | 1693 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 18844 |
| Missing (%) | 4.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51661.962 |
| Minimum | 3 |
|---|---|
| Maximum | 99999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 MiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 22004 |
| Q1 | 41063 |
| median | 54153 |
| Q3 | 54655 |
| 95-th percentile | 99990 |
| Maximum | 99999 |
| Range | 99996 |
| Interquartile range (IQR) | 13592 |
Descriptive statistics
| Standard deviation | 17956.083 |
|---|---|
| Coefficient of variation (CV) | 0.34756875 |
| Kurtosis | 1.5025841 |
| Mean | 51661.962 |
| Median Absolute Deviation (MAD) | 2017 |
| Skewness | 0.460685 |
| Sum | 2.0088237 × 1010 |
| Variance | 3.2242093 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 65002 | 28446 | 7.0% |
| 32022 | 18215 | 4.5% |
| 99990 | 17159 | 4.2% |
| 32111 | 16823 | 4.1% |
| 54167 | 16209 | 4.0% |
| 65000 | 14655 | 3.6% |
| 54106 | 14147 | 3.5% |
| 54655 | 11917 | 2.9% |
| 41063 | 9946 | 2.4% |
| 54146 | 9512 | 2.3% |
| Other values (1683) | 231811 | |
| (Missing) | 18844 | 4.6% |
| Value | Count | Frequency (%) |
| 3 | 27 | |
| 3065 | 1 | < 0.1% |
| 3068 | 1 | < 0.1% |
| 4021 | 6 | < 0.1% |
| 4022 | 67 | |
| 4023 | 1 | < 0.1% |
| 4026 | 1 | < 0.1% |
| 4028 | 1 | < 0.1% |
| 4031 | 5 | < 0.1% |
| 4032 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 5911 | 1.4% |
| 99990 | 17159 | |
| 89105 | 4 | < 0.1% |
| 89005 | 9 | < 0.1% |
| 66218 | 3 | < 0.1% |
| 66211 | 82 | < 0.1% |
| 66210 | 204 | 0.1% |
| 66208 | 1646 | 0.4% |
| 66207 | 169 | < 0.1% |
| 66206 | 6 | < 0.1% |
reason_text
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 1697 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 18844 |
| Missing (%) | 4.6% |
| Memory size | 6.2 MiB |
| 65002 ZZ - LOCAL ORDINANCE VIOL (I) 65002 | 28446 |
|---|---|
| 602 PC - TRESPASSING (M) 32022 | 18215 |
| 647(E) PC - DIS CON:LODGE W/O CONSENT (M) 32111 | 16823 |
| 22450(A) VC - FAIL STOP VEH:XWALK/ETC (I) 54167 | 16209 |
| 65000 ZZ - LOCAL ORDINANCE VIOL (M) 65000 | 14655 |
| Other values (1692) |
Length
| Max length | 56 |
|---|---|
| Median length | 53 |
| Mean length | 44.110012 |
| Min length | 24 |
Characters and Unicode
| Total characters | 17151737 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 445 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 27150(A) VC - INADEQUATE MUFFLERS (I) 54116 |
|---|---|
| 2nd row | 415(2) PC - LOUD/UNREASONABLE NOISE (I) 53130 |
| 3rd row | 647(F) PC - DISORD CONDUCT:ALCOHOL (M) 64005 |
| 4th row | 647(F) PC - DISORD CONDUCT:ALCOHOL (M) 64005 |
| 5th row | 602 PC - TRESPASSING (M) 32022 |
Common Values
| Value | Count | Frequency (%) |
| 65002 ZZ - LOCAL ORDINANCE VIOL (I) 65002 | 28446 | 7.0% |
| 602 PC - TRESPASSING (M) 32022 | 18215 | 4.5% |
| 647(E) PC - DIS CON:LODGE W/O CONSENT (M) 32111 | 16823 | 4.1% |
| 22450(A) VC - FAIL STOP VEH:XWALK/ETC (I) 54167 | 16209 | 4.0% |
| 65000 ZZ - LOCAL ORDINANCE VIOL (M) 65000 | 14655 | 3.6% |
| NA - XX ZZ - COMMUNITY CARETAKING (X) 99990 | 14557 | 3.6% |
| 22350 VC - UNSAFE SPEED:PREVAIL COND (I) 54106 | 14147 | 3.5% |
| 25620 BP - POSS OPEN ALCOHOL:PUBLIC (I) 41063 | 9946 | 2.4% |
| 23123.5 VC - NO HND HLD DEVICE W/DRIVE (I) 54655 | 9532 | 2.3% |
| 21461(A) VC - DRIVER FAIL OBEY SIGN/ETC (I) 54146 | 9512 | 2.3% |
| Other values (1687) | 236798 | |
| (Missing) | 18844 | 4.6% |
Length
| Value | Count | Frequency (%) |
| 412682 | 12.9% | |
| i | 218922 | 6.8% |
| vc | 183432 | 5.7% |
| m | 124213 | 3.9% |
| pc | 111743 | 3.5% |
| zz | 62689 | 2.0% |
| 65002 | 56892 | 1.8% |
| fail | 51302 | 1.6% |
| viol | 51009 | 1.6% |
| local | 43101 | 1.3% |
| Other values (5845) | 1894179 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2821324 | 16.4% | |
| I | 819904 | 4.8% |
| E | 777645 | 4.5% |
| C | 738813 | 4.3% |
| A | 689810 | 4.0% |
| ( | 631649 | 3.7% |
| ) | 631602 | 3.7% |
| O | 612034 | 3.6% |
| 0 | 601951 | 3.5% |
| 2 | 564425 | 3.3% |
| Other values (39) | 8262580 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 8716397 | |
| Decimal Number | 3603590 | |
| Space Separator | 2821324 | 16.4% |
| Open Punctuation | 631649 | 3.7% |
| Close Punctuation | 631602 | 3.7% |
| Dash Punctuation | 413250 | 2.4% |
| Other Punctuation | 332906 | 1.9% |
| Currency Symbol | 829 | < 0.1% |
| Math Symbol | 190 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 819904 | 9.4% |
| E | 777645 | 8.9% |
| C | 738813 | 8.5% |
| A | 689810 | 7.9% |
| O | 612034 | 7.0% |
| N | 548839 | 6.3% |
| L | 545179 | 6.3% |
| T | 438047 | 5.0% |
| S | 427035 | 4.9% |
| P | 415949 | 4.8% |
| Other values (16) | 2703142 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 601951 | |
| 2 | 564425 | |
| 5 | 561441 | |
| 4 | 463016 | |
| 1 | 439987 | |
| 6 | 334787 | |
| 3 | 279634 | |
| 9 | 172665 | 4.8% |
| 7 | 118941 | 3.3% |
| 8 | 66743 | 1.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 169870 | |
| : | 132445 | |
| . | 27167 | 8.2% |
| & | 3285 | 1.0% |
| ' | 95 | < 0.1% |
| " | 42 | < 0.1% |
| , | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2821324 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 631649 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 631602 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 413250 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 829 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 190 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8716397 | |
| Common | 8435340 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 819904 | 9.4% |
| E | 777645 | 8.9% |
| C | 738813 | 8.5% |
| A | 689810 | 7.9% |
| O | 612034 | 7.0% |
| N | 548839 | 6.3% |
| L | 545179 | 6.3% |
| T | 438047 | 5.0% |
| S | 427035 | 4.9% |
| P | 415949 | 4.8% |
| Other values (16) | 2703142 |
Common
| Value | Count | Frequency (%) |
| 2821324 | ||
| ( | 631649 | 7.5% |
| ) | 631602 | 7.5% |
| 0 | 601951 | 7.1% |
| 2 | 564425 | 6.7% |
| 5 | 561441 | 6.7% |
| 4 | 463016 | 5.5% |
| 1 | 439987 | 5.2% |
| - | 413250 | 4.9% |
| 6 | 334787 | 4.0% |
| Other values (13) | 971908 | 11.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17151737 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2821324 | 16.4% | |
| I | 819904 | 4.8% |
| E | 777645 | 4.5% |
| C | 738813 | 4.3% |
| A | 689810 | 4.0% |
| ( | 631649 | 3.7% |
| ) | 631602 | 3.7% |
| O | 612034 | 3.6% |
| 0 | 601951 | 3.5% |
| 2 | 564425 | 3.3% |
| Other values (39) | 8262580 |
reason_detail
Categorical
HIGH CARDINALITY  IMBALANCE  MISSING 
| Distinct | 282 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 18838 |
| Missing (%) | 4.6% |
| Memory size | 6.2 MiB |
| Moving Violation | |
|---|---|
| Officer witnessed commission of a crime | |
| Matched suspect description | |
| Equipment Violation | |
| Other Reasonable Suspicion of a crime | |
| Other values (277) |
Length
| Max length | 232 |
|---|---|
| Median length | 210 |
| Mean length | 29.898466 |
| Min length | 16 |
Characters and Unicode
| Total characters | 11625899 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 131 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Equipment Violation |
|---|---|
| 2nd row | Officer witnessed commission of a crime |
| 3rd row | Officer witnessed commission of a crime |
| 4th row | Officer witnessed commission of a crime |
| 5th row | Matched suspect description |
Common Values
| Value | Count | Frequency (%) |
| Moving Violation | 107984 | |
| Officer witnessed commission of a crime | 85183 | |
| Matched suspect description | 68932 | |
| Equipment Violation | 51210 | |
| Other Reasonable Suspicion of a crime | 36636 | 9.0% |
| Non-moving Violation, including Registration Violation | 15867 | 3.9% |
| Witness or Victim identification of Suspect at the scene | 8664 | 2.1% |
| Matched suspect description|Witness or Victim identification of Suspect at the scene | 2714 | 0.7% |
| Matched suspect description|Officer witnessed commission of a crime | 2175 | 0.5% |
| Witness or Victim identification of Suspect at the scene|Matched suspect description | 1522 | 0.4% |
| Other values (272) | 7959 | 2.0% |
| (Missing) | 18838 | 4.6% |
Length
| Value | Count | Frequency (%) |
| violation | 190928 | |
| of | 146809 | 9.5% |
| a | 130440 | 8.4% |
| crime | 126477 | 8.2% |
| moving | 107984 | 7.0% |
| suspect | 93364 | 6.0% |
| witnessed | 89822 | 5.8% |
| commission | 89822 | 5.8% |
| officer | 86627 | 5.6% |
| matched | 75197 | 4.9% |
| Other values (85) | 412092 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1468469 | |
| 1160716 | 10.0% | |
| o | 1060373 | 9.1% |
| e | 913193 | 7.9% |
| n | 838158 | 7.2% |
| s | 755743 | 6.5% |
| t | 753249 | 6.5% |
| c | 671800 | 5.8% |
| a | 532953 | 4.6% |
| m | 392075 | 3.4% |
| Other values (36) | 3079170 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9702646 | |
| Space Separator | 1160716 | 10.0% |
| Uppercase Letter | 717984 | 6.2% |
| Other Punctuation | 15873 | 0.1% |
| Dash Punctuation | 15871 | 0.1% |
| Math Symbol | 12785 | 0.1% |
| Decimal Number | 24 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1468469 | |
| o | 1060373 | |
| e | 913193 | |
| n | 838158 | |
| s | 755743 | 7.8% |
| t | 753249 | 7.8% |
| c | 671800 | 6.9% |
| a | 532953 | 5.5% |
| m | 392075 | 4.0% |
| r | 373189 | 3.8% |
| Other values (15) | 1943444 |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 205240 | |
| M | 187036 | |
| O | 129951 | |
| R | 55291 | 7.7% |
| S | 54783 | 7.6% |
| E | 51210 | 7.1% |
| N | 15867 | 2.2% |
| W | 14312 | 2.0% |
| A | 3251 | 0.5% |
| C | 705 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8 | |
| 4 | 6 | |
| 8 | 4 | |
| 9 | 4 | |
| 7 | 2 | 8.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 15869 | |
| . | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1160716 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15871 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 12785 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10420630 | |
| Common | 1205269 | 10.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1468469 | |
| o | 1060373 | 10.2% |
| e | 913193 | 8.8% |
| n | 838158 | 8.0% |
| s | 755743 | 7.3% |
| t | 753249 | 7.2% |
| c | 671800 | 6.4% |
| a | 532953 | 5.1% |
| m | 392075 | 3.8% |
| r | 373189 | 3.6% |
| Other values (26) | 2661428 |
Common
| Value | Count | Frequency (%) |
| 1160716 | ||
| - | 15871 | 1.3% |
| , | 15869 | 1.3% |
| | | 12785 | 1.1% |
| 0 | 8 | < 0.1% |
| 4 | 6 | < 0.1% |
| 8 | 4 | < 0.1% |
| 9 | 4 | < 0.1% |
| . | 4 | < 0.1% |
| 7 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11625899 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1468469 | |
| 1160716 | 10.0% | |
| o | 1060373 | 9.1% |
| e | 913193 | 7.9% |
| n | 838158 | 7.2% |
| s | 755743 | 6.5% |
| t | 753249 | 6.5% |
| c | 671800 | 5.8% |
| a | 532953 | 4.6% |
| m | 392075 | 3.4% |
| Other values (36) | 3079170 |
reason_exp
Categorical
| Distinct | 183583 |
|---|---|
| Distinct (%) | 45.0% |
| Missing | 82 |
| Missing (%) | < 0.1% |
| Memory size | 6.2 MiB |
| cell phone | 4819 |
|---|---|
| stop sign | 4721 |
| speeding | 4497 |
| SPEED | 4155 |
| encroachment | 3658 |
| Other values (183578) |
Length
| Max length | 250 |
|---|---|
| Median length | 235 |
| Mean length | 28.53013 |
| Min length | 2 |
Characters and Unicode
| Total characters | 11628938 |
|---|---|
| Distinct characters | 92 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 155880 ? |
|---|---|
| Unique (%) | 38.2% |
Sample
| 1st row | LOUD EXHAUST |
|---|---|
| 2nd row | loud party |
| 3rd row | stumbling back and forth, unable to maintain balance |
| 4th row | fighting with security |
| 5th row | rc of male at vacant house |
Common Values
| Value | Count | Frequency (%) |
| cell phone | 4819 | 1.2% |
| stop sign | 4721 | 1.2% |
| speeding | 4497 | 1.1% |
| SPEED | 4155 | 1.0% |
| encroachment | 3658 | 0.9% |
| radio call | 3482 | 0.9% |
| speed | 3479 | 0.9% |
| STOP SIGN | 2446 | 0.6% |
| CELL PHONE | 2071 | 0.5% |
| ped stop | 2056 | 0.5% |
| Other values (183573) | 372218 |
Length
| Value | Count | Frequency (%) |
| in | 54094 | 2.7% |
| subject | 51302 | 2.6% |
| of | 50374 | 2.6% |
| on | 43020 | 2.2% |
| a | 40364 | 2.1% |
| was | 37897 | 1.9% |
| to | 35570 | 1.8% |
| stop | 35343 | 1.8% |
| call | 29540 | 1.5% |
| and | 27506 | 1.4% |
| Other values (28030) | 1562251 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1563291 | 13.4% | |
| e | 750025 | 6.4% |
| i | 566936 | 4.9% |
| t | 537857 | 4.6% |
| a | 530621 | 4.6% |
| n | 527560 | 4.5% |
| o | 497686 | 4.3% |
| s | 430617 | 3.7% |
| r | 398458 | 3.4% |
| l | 366458 | 3.2% |
| Other values (82) | 5459429 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6810897 | |
| Uppercase Letter | 2968066 | |
| Space Separator | 1563291 | 13.4% |
| Decimal Number | 177838 | 1.5% |
| Other Punctuation | 96107 | 0.8% |
| Dash Punctuation | 6337 | 0.1% |
| Open Punctuation | 3051 | < 0.1% |
| Close Punctuation | 3006 | < 0.1% |
| Math Symbol | 273 | < 0.1% |
| Currency Symbol | 62 | < 0.1% |
| Other values (2) | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 750025 | 11.0% |
| i | 566936 | 8.3% |
| t | 537857 | 7.9% |
| a | 530621 | 7.8% |
| n | 527560 | 7.7% |
| o | 497686 | 7.3% |
| s | 430617 | 6.3% |
| r | 398458 | 5.9% |
| l | 366458 | 5.4% |
| d | 296457 | 4.4% |
| Other values (16) | 1908222 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 317034 | 10.7% |
| I | 242750 | 8.2% |
| N | 222256 | 7.5% |
| T | 221118 | 7.4% |
| S | 214555 | 7.2% |
| A | 214253 | 7.2% |
| O | 211596 | 7.1% |
| R | 174216 | 5.9% |
| L | 159276 | 5.4% |
| D | 140110 | 4.7% |
| Other values (16) | 850902 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 63525 | |
| , | 17132 | 17.8% |
| / | 10546 | 11.0% |
| ' | 2515 | 2.6% |
| & | 907 | 0.9% |
| " | 635 | 0.7% |
| ; | 263 | 0.3% |
| # | 170 | 0.2% |
| : | 156 | 0.2% |
| @ | 89 | 0.1% |
| Other values (5) | 169 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 43365 | |
| 1 | 38792 | |
| 0 | 30811 | |
| 4 | 19052 | |
| 2 | 14599 | 8.2% |
| 6 | 9982 | 5.6% |
| 3 | 6180 | 3.5% |
| 7 | 5266 | 3.0% |
| 8 | 5078 | 2.9% |
| 9 | 4713 | 2.7% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 211 | |
| > | 42 | 15.4% |
| = | 10 | 3.7% |
| < | 8 | 2.9% |
| ~ | 2 | 0.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3004 | |
| [ | 47 | 1.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2983 | |
| ] | 23 | 0.8% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 4 | |
| ^ | 1 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 1563291 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6337 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 62 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9778963 | |
| Common | 1849975 | 15.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 750025 | 7.7% |
| i | 566936 | 5.8% |
| t | 537857 | 5.5% |
| a | 530621 | 5.4% |
| n | 527560 | 5.4% |
| o | 497686 | 5.1% |
| s | 430617 | 4.4% |
| r | 398458 | 4.1% |
| l | 366458 | 3.7% |
| E | 317034 | 3.2% |
| Other values (42) | 4855711 |
Common
| Value | Count | Frequency (%) |
| 1563291 | ||
| . | 63525 | 3.4% |
| 5 | 43365 | 2.3% |
| 1 | 38792 | 2.1% |
| 0 | 30811 | 1.7% |
| 4 | 19052 | 1.0% |
| , | 17132 | 0.9% |
| 2 | 14599 | 0.8% |
| / | 10546 | 0.6% |
| 6 | 9982 | 0.5% |
| Other values (30) | 38880 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11628938 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1563291 | 13.4% | |
| e | 750025 | 6.4% |
| i | 566936 | 4.9% |
| t | 537857 | 4.6% |
| a | 530621 | 4.6% |
| n | 527560 | 4.5% |
| o | 497686 | 4.3% |
| s | 430617 | 3.7% |
| r | 398458 | 3.4% |
| l | 366458 | 3.2% |
| Other values (82) | 5459429 |
search_basis
Categorical
HIGH CARDINALITY  IMBALANCE  MISSING 
| Distinct | 721 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 321160 |
| Missing (%) | 78.8% |
| Memory size | 6.2 MiB |
| Incident to arrest | |
|---|---|
| Condition of parole / probation/ PRCS / mandatory supervision | |
| Consent given | |
| Officer Safety/safety of others | |
| Vehicle inventory | 1963 |
| Other values (716) |
Length
| Max length | 182 |
|---|---|
| Median length | 174 |
| Mean length | 34.487506 |
| Min length | 13 |
Characters and Unicode
| Total characters | 2983997 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 396 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Vehicle inventory |
|---|---|
| 2nd row | Incident to arrest |
| 3rd row | Incident to arrest |
| 4th row | Incident to arrest |
| 5th row | Incident to arrest |
Common Values
| Value | Count | Frequency (%) |
| Incident to arrest | 39048 | 9.6% |
| Condition of parole / probation/ PRCS / mandatory supervision | 23248 | 5.7% |
| Consent given | 5589 | 1.4% |
| Officer Safety/safety of others | 3961 | 1.0% |
| Vehicle inventory | 1963 | 0.5% |
| Condition of parole / probation/ PRCS / mandatory supervision|Incident to arrest | 1034 | 0.3% |
| Visible contraband | 938 | 0.2% |
| Incident to arrest|Officer Safety/safety of others | 911 | 0.2% |
| Incident to arrest|Condition of parole / probation/ PRCS / mandatory supervision | 715 | 0.2% |
| Consent given|Incident to arrest | 564 | 0.1% |
| Other values (711) | 8553 | 2.1% |
| (Missing) | 321160 |
Length
| Value | Count | Frequency (%) |
| 53566 | ||
| to | 45600 | |
| arrest | 42100 | |
| incident | 42093 | |
| of | 36452 | |
| parole | 26783 | 6.2% |
| probation | 26783 | 6.2% |
| prcs | 26783 | 6.2% |
| mandatory | 26783 | 6.2% |
| condition | 25187 | 5.8% |
| Other values (128) | 83147 |
Most occurring characters
| Value | Count | Frequency (%) |
| 348753 | ||
| o | 294376 | 9.9% |
| n | 268287 | 9.0% |
| t | 257125 | 8.6% |
| r | 222991 | 7.5% |
| e | 214760 | 7.2% |
| i | 209486 | 7.0% |
| a | 176466 | 5.9% |
| s | 129107 | 4.3% |
| d | 105784 | 3.5% |
| Other values (24) | 756862 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2321650 | |
| Space Separator | 348753 | 11.7% |
| Uppercase Letter | 213531 | 7.2% |
| Other Punctuation | 88097 | 3.0% |
| Math Symbol | 11966 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 294376 | |
| n | 268287 | |
| t | 257125 | |
| r | 222991 | |
| e | 214760 | |
| i | 209486 | |
| a | 176466 | |
| s | 129107 | 5.6% |
| d | 105784 | 4.6% |
| p | 83631 | 3.6% |
| Other values (12) | 359637 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 62715 | |
| I | 45600 | |
| S | 36333 | |
| R | 26783 | |
| P | 26783 | |
| O | 8261 | 3.9% |
| V | 4990 | 2.3% |
| E | 1652 | 0.8% |
| W | 414 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 348753 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 88097 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 11966 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2535181 | |
| Common | 448816 | 15.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 294376 | |
| n | 268287 | |
| t | 257125 | |
| r | 222991 | 8.8% |
| e | 214760 | 8.5% |
| i | 209486 | 8.3% |
| a | 176466 | 7.0% |
| s | 129107 | 5.1% |
| d | 105784 | 4.2% |
| p | 83631 | 3.3% |
| Other values (21) | 573168 |
Common
| Value | Count | Frequency (%) |
| 348753 | ||
| / | 88097 | 19.6% |
| | | 11966 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2983997 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 348753 | ||
| o | 294376 | 9.9% |
| n | 268287 | 9.0% |
| t | 257125 | 8.6% |
| r | 222991 | 7.5% |
| e | 214760 | 7.2% |
| i | 209486 | 7.0% |
| a | 176466 | 5.9% |
| s | 129107 | 4.3% |
| d | 105784 | 3.5% |
| Other values (24) | 756862 |
search_basis_exp
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 28990 |
|---|---|
| Distinct (%) | 45.7% |
| Missing | 344258 |
| Missing (%) | 84.4% |
| Memory size | 6.2 MiB |
| incident to arrest | 2366 |
|---|---|
| search incident to arrest | 1541 |
| arrest | 1321 |
| arrested | 800 |
| Incident to arrest | 772 |
| Other values (28985) |
Length
| Max length | 250 |
|---|---|
| Median length | 236 |
| Mean length | 27.766011 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1761087 |
|---|---|
| Distinct characters | 88 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 24904 ? |
|---|---|
| Unique (%) | 39.3% |
Sample
| 1st row | IMPOUNDED |
|---|---|
| 2nd row | search incident to arrest |
| 3rd row | search incident to arrest |
| 4th row | Male drunk in public |
| 5th row | 273.6 viloation of TRO |
Common Values
| Value | Count | Frequency (%) |
| incident to arrest | 2366 | 0.6% |
| search incident to arrest | 1541 | 0.4% |
| arrest | 1321 | 0.3% |
| arrested | 800 | 0.2% |
| Incident to arrest | 772 | 0.2% |
| INCIDENT TO ARREST | 652 | 0.2% |
| searched incident to arrest | 572 | 0.1% |
| consent | 550 | 0.1% |
| 5150 hold | 516 | 0.1% |
| consent search | 465 | 0.1% |
| Other values (28980) | 53871 | 13.2% |
| (Missing) | 344258 |
Length
| Value | Count | Frequency (%) |
| to | 20323 | 7.0% |
| arrest | 20146 | 7.0% |
| for | 13693 | 4.7% |
| incident | 12131 | 4.2% |
| search | 9435 | 3.3% |
| arrested | 9119 | 3.2% |
| subject | 8539 | 3.0% |
| was | 8139 | 2.8% |
| searched | 6252 | 2.2% |
| and | 5808 | 2.0% |
| Other values (7320) | 175627 |
Most occurring characters
| Value | Count | Frequency (%) |
| 226198 | 12.8% | |
| e | 138232 | 7.8% |
| r | 116668 | 6.6% |
| t | 104510 | 5.9% |
| a | 100275 | 5.7% |
| n | 85671 | 4.9% |
| s | 78939 | 4.5% |
| o | 76673 | 4.4% |
| i | 63628 | 3.6% |
| c | 59719 | 3.4% |
| Other values (78) | 710574 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1099455 | |
| Uppercase Letter | 359554 | 20.4% |
| Space Separator | 226198 | 12.8% |
| Decimal Number | 54950 | 3.1% |
| Other Punctuation | 14716 | 0.8% |
| Close Punctuation | 2745 | 0.2% |
| Open Punctuation | 2743 | 0.2% |
| Dash Punctuation | 699 | < 0.1% |
| Math Symbol | 14 | < 0.1% |
| Currency Symbol | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 138232 | |
| r | 116668 | |
| t | 104510 | |
| a | 100275 | |
| n | 85671 | 7.8% |
| s | 78939 | 7.2% |
| o | 76673 | 7.0% |
| i | 63628 | 5.8% |
| c | 59719 | 5.4% |
| d | 58870 | 5.4% |
| Other values (16) | 216270 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 39943 | |
| R | 33124 | |
| A | 32432 | |
| T | 31278 | 8.7% |
| S | 30653 | 8.5% |
| N | 26150 | 7.3% |
| O | 23930 | 6.7% |
| I | 22840 | 6.4% |
| C | 20263 | 5.6% |
| D | 19228 | 5.3% |
| Other values (16) | 79713 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 9951 | |
| , | 2588 | 17.6% |
| / | 1104 | 7.5% |
| & | 597 | 4.1% |
| ' | 293 | 2.0% |
| : | 62 | 0.4% |
| ; | 53 | 0.4% |
| " | 40 | 0.3% |
| # | 15 | 0.1% |
| ? | 4 | < 0.1% |
| Other values (4) | 9 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 14267 | |
| 1 | 12107 | |
| 0 | 8044 | |
| 4 | 5518 | 10.0% |
| 2 | 4591 | 8.4% |
| 6 | 3142 | 5.7% |
| 7 | 2588 | 4.7% |
| 3 | 2167 | 3.9% |
| 9 | 1394 | 2.5% |
| 8 | 1132 | 2.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 6 | |
| = | 5 | |
| > | 2 | 14.3% |
| < | 1 | 7.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2744 | |
| ] | 1 | < 0.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 2 | |
| ` | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 226198 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2743 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 699 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1459009 | |
| Common | 302078 | 17.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 138232 | 9.5% |
| r | 116668 | 8.0% |
| t | 104510 | 7.2% |
| a | 100275 | 6.9% |
| n | 85671 | 5.9% |
| s | 78939 | 5.4% |
| o | 76673 | 5.3% |
| i | 63628 | 4.4% |
| c | 59719 | 4.1% |
| d | 58870 | 4.0% |
| Other values (42) | 575824 |
Common
| Value | Count | Frequency (%) |
| 226198 | ||
| 5 | 14267 | 4.7% |
| 1 | 12107 | 4.0% |
| . | 9951 | 3.3% |
| 0 | 8044 | 2.7% |
| 4 | 5518 | 1.8% |
| 2 | 4591 | 1.5% |
| 6 | 3142 | 1.0% |
| ) | 2744 | 0.9% |
| ( | 2743 | 0.9% |
| Other values (26) | 12773 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1761087 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 226198 | 12.8% | |
| e | 138232 | 7.8% |
| r | 116668 | 6.6% |
| t | 104510 | 5.9% |
| a | 100275 | 5.7% |
| n | 85671 | 4.9% |
| s | 78939 | 4.5% |
| o | 76673 | 4.4% |
| i | 63628 | 3.6% |
| c | 59719 | 3.4% |
| Other values (78) | 710574 |
seiz_basis
Categorical
| Distinct | 49 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 398568 |
| Missing (%) | 97.8% |
| Memory size | 6.2 MiB |
| Evidence | |
|---|---|
| Contraband | |
| Impound of vehicle | |
| Contraband|Evidence | |
| Evidence|Contraband | |
| Other values (44) |
Length
| Max length | 76 |
|---|---|
| Median length | 65 |
| Mean length | 15.307591 |
| Min length | 8 |
Characters and Unicode
| Total characters | 139544 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Contraband |
|---|---|
| 2nd row | Evidence|Impound of vehicle |
| 3rd row | Contraband|Evidence |
| 4th row | Impound of vehicle |
| 5th row | Evidence |
Common Values
| Value | Count | Frequency (%) |
| Evidence | 2999 | 0.7% |
| Contraband | 2090 | 0.5% |
| Impound of vehicle | 1217 | 0.3% |
| Contraband|Evidence | 1049 | 0.3% |
| Evidence|Contraband | 551 | 0.1% |
| Safekeeping as allowed by law/statute | 387 | 0.1% |
| Evidence|Impound of vehicle | 237 | 0.1% |
| Contraband|Evidence|Impound of vehicle | 106 | < 0.1% |
| Abandoned property | 76 | < 0.1% |
| Contraband|Impound of vehicle | 57 | < 0.1% |
| Other values (39) | 347 | 0.1% |
| (Missing) | 398568 |
Length
| Value | Count | Frequency (%) |
| evidence | 2999 | |
| contraband | 2090 | |
| of | 1787 | |
| vehicle | 1662 | |
| impound | 1307 | |
| contraband|evidence | 1049 | 7.0% |
| as | 562 | 3.7% |
| allowed | 562 | 3.7% |
| by | 562 | 3.7% |
| evidence|contraband | 551 | 3.7% |
| Other values (38) | 1913 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 17040 | |
| n | 15843 | 11.4% |
| d | 11812 | 8.5% |
| a | 10977 | 7.9% |
| o | 8379 | 6.0% |
| i | 7575 | 5.4% |
| c | 7013 | 5.0% |
| v | 7011 | 5.0% |
| 5928 | 4.2% | |
| t | 5823 | 4.2% |
| Other values (20) | 42143 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 118754 | |
| Uppercase Letter | 11708 | 8.4% |
| Space Separator | 5928 | 4.2% |
| Math Symbol | 2592 | 1.9% |
| Other Punctuation | 562 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 17040 | |
| n | 15843 | |
| d | 11812 | |
| a | 10977 | |
| o | 8379 | 7.1% |
| i | 7575 | 6.4% |
| c | 7013 | 5.9% |
| v | 7011 | 5.9% |
| t | 5823 | 4.9% |
| b | 4697 | 4.0% |
| Other values (12) | 22584 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 5224 | |
| C | 4031 | |
| I | 1786 | 15.3% |
| S | 563 | 4.8% |
| A | 104 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 5928 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 2592 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 562 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 130462 | |
| Common | 9082 | 6.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 17040 | |
| n | 15843 | |
| d | 11812 | 9.1% |
| a | 10977 | 8.4% |
| o | 8379 | 6.4% |
| i | 7575 | 5.8% |
| c | 7013 | 5.4% |
| v | 7011 | 5.4% |
| t | 5823 | 4.5% |
| E | 5224 | 4.0% |
| Other values (17) | 33765 |
Common
| Value | Count | Frequency (%) |
| 5928 | ||
| | | 2592 | |
| / | 562 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 139544 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 17040 | |
| n | 15843 | 11.4% |
| d | 11812 | 8.5% |
| a | 10977 | 7.9% |
| o | 8379 | 6.0% |
| i | 7575 | 5.4% |
| c | 7013 | 5.0% |
| v | 7011 | 5.0% |
| 5928 | 4.2% | |
| t | 5823 | 4.2% |
| Other values (20) | 42143 |
prop_type
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 490 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 398568 |
| Missing (%) | 97.8% |
| Memory size | 6.2 MiB |
| Drugs/narcotics | |
|---|---|
| Vehicle | |
| Drug Paraphernalia | |
| Drugs/narcotics|Drug Paraphernalia | |
| Other Contraband or evidence | |
| Other values (485) |
Length
| Max length | 202 |
|---|---|
| Median length | 151 |
| Mean length | 27.323058 |
| Min length | 5 |
Characters and Unicode
| Total characters | 249077 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 259 ? |
|---|---|
| Unique (%) | 2.8% |
Sample
| 1st row | Drugs/narcotics |
|---|---|
| 2nd row | Alcohol |
| 3rd row | Drugs/narcotics|Money|Drug Paraphernalia |
| 4th row | Vehicle |
| 5th row | Firearm(s)|Ammunition|Cell phone(s) or electronic device(s) |
Common Values
| Value | Count | Frequency (%) |
| Drugs/narcotics | 1551 | 0.4% |
| Vehicle | 1194 | 0.3% |
| Drug Paraphernalia | 1115 | 0.3% |
| Drugs/narcotics|Drug Paraphernalia | 794 | 0.2% |
| Other Contraband or evidence | 586 | 0.1% |
| Weapon(s) other than a firearm | 509 | 0.1% |
| Alcohol | 483 | 0.1% |
| Drug Paraphernalia|Drugs/narcotics | 236 | 0.1% |
| Cell phone(s) or electronic device(s) | 201 | < 0.1% |
| Suspected Stolen property | 176 | < 0.1% |
| Other values (480) | 2271 | 0.6% |
| (Missing) | 398568 |
Length
| Value | Count | Frequency (%) |
| paraphernalia | 2089 | 8.8% |
| or | 2046 | 8.6% |
| drug | 1570 | 6.6% |
| drugs/narcotics | 1551 | 6.5% |
| other | 1518 | 6.4% |
| vehicle | 1194 | 5.0% |
| contraband | 1171 | 4.9% |
| drugs/narcotics|drug | 1057 | 4.4% |
| evidence | 1054 | 4.4% |
| a | 876 | 3.7% |
| Other values (233) | 9672 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 26702 | 10.7% |
| a | 22214 | 8.9% |
| e | 22150 | 8.9% |
| n | 15726 | 6.3% |
| 14682 | 5.9% | |
| c | 14101 | 5.7% |
| o | 13723 | 5.5% |
| i | 13625 | 5.5% |
| s | 11269 | 4.5% |
| t | 10895 | 4.4% |
| Other values (25) | 83990 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 200742 | |
| Uppercase Letter | 18711 | 7.5% |
| Space Separator | 14682 | 5.9% |
| Math Symbol | 4834 | 1.9% |
| Other Punctuation | 3760 | 1.5% |
| Close Punctuation | 3174 | 1.3% |
| Open Punctuation | 3174 | 1.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 26702 | |
| a | 22214 | |
| e | 22150 | |
| n | 15726 | 7.8% |
| c | 14101 | 7.0% |
| o | 13723 | 6.8% |
| i | 13625 | 6.8% |
| s | 11269 | 5.6% |
| t | 10895 | 5.4% |
| h | 9023 | 4.5% |
| Other values (10) | 41314 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 6775 | |
| P | 3015 | |
| C | 2046 | 10.9% |
| V | 1623 | 8.7% |
| O | 1171 | 6.3% |
| S | 1150 | 6.1% |
| A | 1028 | 5.5% |
| W | 876 | 4.7% |
| F | 548 | 2.9% |
| M | 479 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 14682 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 4834 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3760 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3174 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3174 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 219453 | |
| Common | 29624 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 26702 | |
| a | 22214 | 10.1% |
| e | 22150 | 10.1% |
| n | 15726 | 7.2% |
| c | 14101 | 6.4% |
| o | 13723 | 6.3% |
| i | 13625 | 6.2% |
| s | 11269 | 5.1% |
| t | 10895 | 5.0% |
| h | 9023 | 4.1% |
| Other values (20) | 60025 |
Common
| Value | Count | Frequency (%) |
| 14682 | ||
| | | 4834 | 16.3% |
| / | 3760 | 12.7% |
| ) | 3174 | 10.7% |
| ( | 3174 | 10.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 249077 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 26702 | 10.7% |
| a | 22214 | 8.9% |
| e | 22150 | 8.9% |
| n | 15726 | 6.3% |
| 14682 | 5.9% | |
| c | 14101 | 5.7% |
| o | 13723 | 5.5% |
| i | 13625 | 5.5% |
| s | 11269 | 4.5% |
| t | 10895 | 4.4% |
| Other values (25) | 83990 |
cont
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 669 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 6.2 MiB |
| None | |
|---|---|
| Alcohol | 11476 |
| Drugs/narcotics | 6526 |
| Drug Paraphernalia | 4943 |
| Drugs/narcotics|Drug Paraphernalia | 2538 |
| Other values (664) | 12470 |
Length
| Max length | 194 |
|---|---|
| Median length | 4 |
| Mean length | 5.6240277 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2292798 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 322 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | None |
|---|---|
| 2nd row | None |
| 3rd row | None |
| 4th row | None |
| 5th row | None |
Common Values
| Value | Count | Frequency (%) |
| None | 369726 | |
| Alcohol | 11476 | 2.8% |
| Drugs/narcotics | 6526 | 1.6% |
| Drug Paraphernalia | 4943 | 1.2% |
| Drugs/narcotics|Drug Paraphernalia | 2538 | 0.6% |
| Weapon(s) other than a firearm | 2222 | 0.5% |
| Other Contraband or evidence | 2068 | 0.5% |
| Drug Paraphernalia|Drugs/narcotics | 947 | 0.2% |
| Suspected Stolen property | 731 | 0.2% |
| Firearm(s) | 665 | 0.2% |
| Other values (659) | 5837 | 1.4% |
Length
| Value | Count | Frequency (%) |
| none | 369726 | |
| alcohol | 11476 | 2.5% |
| paraphernalia | 8064 | 1.8% |
| drugs/narcotics | 6526 | 1.4% |
| drug | 6396 | 1.4% |
| other | 5435 | 1.2% |
| or | 5160 | 1.1% |
| than | 3255 | 0.7% |
| a | 3255 | 0.7% |
| contraband | 3241 | 0.7% |
| Other values (301) | 29237 | 6.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 431619 | |
| e | 424236 | |
| n | 418639 | |
| N | 369726 | |
| r | 86923 | 3.8% |
| a | 75823 | 3.3% |
| c | 48344 | 2.1% |
| 44092 | 1.9% | |
| l | 42132 | 1.8% |
| i | 37630 | 1.6% |
| Other values (25) | 313634 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1771285 | |
| Uppercase Letter | 434995 | 19.0% |
| Space Separator | 44092 | 1.9% |
| Other Punctuation | 12790 | 0.6% |
| Math Symbol | 12012 | 0.5% |
| Open Punctuation | 8812 | 0.4% |
| Close Punctuation | 8812 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 431619 | |
| e | 424236 | |
| n | 418639 | |
| r | 86923 | 4.9% |
| a | 75823 | 4.3% |
| c | 48344 | 2.7% |
| l | 42132 | 2.4% |
| i | 37630 | 2.1% |
| s | 36002 | 2.0% |
| h | 34279 | 1.9% |
| Other values (10) | 135658 | 7.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 369726 | |
| D | 23243 | 5.3% |
| A | 13323 | 3.1% |
| P | 10453 | 2.4% |
| C | 5160 | 1.2% |
| W | 3255 | 0.7% |
| O | 3241 | 0.7% |
| S | 3220 | 0.7% |
| F | 1719 | 0.4% |
| M | 1655 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 44092 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 12790 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 12012 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8812 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 8812 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2206280 | |
| Common | 86518 | 3.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 431619 | |
| e | 424236 | |
| n | 418639 | |
| N | 369726 | |
| r | 86923 | 3.9% |
| a | 75823 | 3.4% |
| c | 48344 | 2.2% |
| l | 42132 | 1.9% |
| i | 37630 | 1.7% |
| s | 36002 | 1.6% |
| Other values (20) | 235206 |
Common
| Value | Count | Frequency (%) |
| 44092 | ||
| / | 12790 | 14.8% |
| | | 12012 | 13.9% |
| ( | 8812 | 10.2% |
| ) | 8812 | 10.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2292798 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 431619 | |
| e | 424236 | |
| n | 418639 | |
| N | 369726 | |
| r | 86923 | 3.8% |
| a | 75823 | 3.3% |
| c | 48344 | 2.1% |
| 44092 | 1.9% | |
| l | 42132 | 1.8% |
| i | 37630 | 1.6% |
| Other values (25) | 313634 |
actions
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 11672 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 6.2 MiB |
| None | |
|---|---|
| Curbside detention | 24321 |
| Handcuffed or flex cuffed | 16308 |
| Search of person was conducted|Handcuffed or flex cuffed | 9208 |
| Handcuffed or flex cuffed|Search of person was conducted | 9022 |
| Other values (11667) |
Length
| Max length | 360 |
|---|---|
| Median length | 4 |
| Mean length | 28.120136 |
| Min length | 4 |
Characters and Unicode
| Total characters | 11463989 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 8059 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | Search of property was conducted|Vehicle impounded |
|---|---|
| 2nd row | Curbside detention |
| 3rd row | Patrol car detention|Handcuffed or flex cuffed|Search of person was conducted |
| 4th row | Curbside detention|Handcuffed or flex cuffed|Search of person was conducted |
| 5th row | None |
Common Values
| Value | Count | Frequency (%) |
| None | 246838 | |
| Curbside detention | 24321 | 6.0% |
| Handcuffed or flex cuffed | 16308 | 4.0% |
| Search of person was conducted|Handcuffed or flex cuffed | 9208 | 2.3% |
| Handcuffed or flex cuffed|Search of person was conducted | 9022 | 2.2% |
| Patrol car detention|Handcuffed or flex cuffed | 3113 | 0.8% |
| Curbside detention|Handcuffed or flex cuffed | 2976 | 0.7% |
| Patrol car detention|Search of person was conducted|Handcuffed or flex cuffed | 2743 | 0.7% |
| Person photographed | 2531 | 0.6% |
| Search of person was conducted|Handcuffed or flex cuffed|Search of property was conducted | 2520 | 0.6% |
| Other values (11662) | 88099 | 21.6% |
Length
| Value | Count | Frequency (%) |
| none | 246838 | |
| was | 125241 | 7.8% |
| or | 117174 | 7.3% |
| of | 116125 | 7.3% |
| flex | 111013 | 6.9% |
| person | 90179 | 5.6% |
| cuffed | 51222 | 3.2% |
| conducted | 47480 | 3.0% |
| search | 45074 | 2.8% |
| handcuffed | 44529 | 2.8% |
| Other values (233) | 602459 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1462183 | |
| 1189655 | 10.4% | |
| o | 1051750 | 9.2% |
| n | 837630 | 7.3% |
| d | 819897 | 7.2% |
| r | 725015 | 6.3% |
| f | 707926 | 6.2% |
| c | 705365 | 6.2% |
| t | 485449 | 4.2% |
| a | 474788 | 4.1% |
| Other values (28) | 3004331 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9391409 | |
| Space Separator | 1189655 | 10.4% |
| Uppercase Letter | 648121 | 5.7% |
| Math Symbol | 234804 | 2.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1462183 | |
| o | 1051750 | |
| n | 837630 | |
| d | 819897 | |
| r | 725015 | |
| f | 707926 | |
| c | 705365 | |
| t | 485449 | 5.2% |
| a | 474788 | 5.1% |
| u | 406584 | 4.3% |
| Other values (15) | 1714822 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 246838 | |
| S | 116125 | |
| H | 111013 | |
| P | 81745 | 12.6% |
| C | 58772 | 9.1% |
| A | 16520 | 2.5% |
| V | 10181 | 1.6% |
| F | 6635 | 1.0% |
| E | 190 | < 0.1% |
| I | 55 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1189655 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 234804 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10039530 | |
| Common | 1424459 | 12.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1462183 | |
| o | 1051750 | |
| n | 837630 | 8.3% |
| d | 819897 | 8.2% |
| r | 725015 | 7.2% |
| f | 707926 | 7.1% |
| c | 705365 | 7.0% |
| t | 485449 | 4.8% |
| a | 474788 | 4.7% |
| u | 406584 | 4.0% |
| Other values (26) | 2362943 |
Common
| Value | Count | Frequency (%) |
| 1189655 | ||
| | | 234804 | 16.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11463989 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1462183 | |
| 1189655 | 10.4% | |
| o | 1051750 | 9.2% |
| n | 837630 | 7.3% |
| d | 819897 | 7.2% |
| r | 725015 | 6.3% |
| f | 707926 | 6.2% |
| c | 705365 | 6.2% |
| t | 485449 | 4.2% |
| a | 474788 | 4.1% |
| Other values (28) | 3004331 |
act_consent
Categorical
HIGH CARDINALITY  IMBALANCE  MISSING 
| Distinct | 335 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 297641 |
| Missing (%) | 73.0% |
| Memory size | 6.2 MiB |
| NA|NA | |
|---|---|
| NA|NA|NA | |
| NA|NA|NA|NA | |
| NA|NA|NA|NA|NA | |
| NA|NA|NA|NA|NA|NA | 2403 |
| Other values (330) |
Length
| Max length | 35 |
|---|---|
| Median length | 34 |
| Mean length | 8.251429 |
| Min length | 1 |
Characters and Unicode
| Total characters | 908012 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 95 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | NA|NA |
|---|---|
| 2nd row | NA|NA|NA |
| 3rd row | NA|NA|NA |
| 4th row | NA|NA |
| 5th row | NA|NA|NA |
Common Values
| Value | Count | Frequency (%) |
| NA|NA | 38272 | 9.4% |
| NA|NA|NA | 29881 | 7.3% |
| NA|NA|NA|NA | 17701 | 4.3% |
| NA|NA|NA|NA|NA | 7547 | 1.9% |
| NA|NA|NA|NA|NA|NA | 2403 | 0.6% |
| Y|NA | 1814 | 0.4% |
| NA|Y | 1112 | 0.3% |
| Y|NA|NA | 1095 | 0.3% |
| NA|Y|NA | 892 | 0.2% |
| NA|NA|NA|NA|NA|NA|NA | 827 | 0.2% |
| Other values (325) | 8499 | 2.1% |
| (Missing) | 297641 |
Length
| Value | Count | Frequency (%) |
| na|na | 38272 | |
| na|na|na | 29881 | |
| na|na|na|na | 17701 | |
| na|na|na|na|na | 7547 | 6.9% |
| na|na|na|na|na|na | 2403 | 2.2% |
| y|na | 1814 | 1.6% |
| na|y | 1112 | 1.0% |
| y|na|na | 1095 | 1.0% |
| na|y|na | 892 | 0.8% |
| na|na|na|na|na|na|na | 827 | 0.8% |
| Other values (325) | 8499 | 7.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 330813 | |
| A | 328361 | |
| | | 234804 | |
| Y | 14034 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 673208 | |
| Math Symbol | 234804 | 25.9% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 330813 | |
| A | 328361 | |
| Y | 14034 | 2.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 234804 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 673208 | |
| Common | 234804 | 25.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 330813 | |
| A | 328361 | |
| Y | 14034 | 2.1% |
Common
| Value | Count | Frequency (%) |
| | | 234804 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 908012 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 330813 | |
| A | 328361 | |
| | | 234804 | |
| Y | 14034 | 1.5% |
| Unnamed: 0 | stop_id | pid | id | ori | agency | exp_years | date | time | dur | is_serv | assign_key | assign_words | inters | block | ldmk | street | hw_exit | is_school | school_name | city | beat | beat_name | is_student | lim_eng | age | gender_words | is_gendnc | gender_code | gendnc_code | lgbt | race | disability | reason_words | reasonid | reason_text | reason_detail | reason_exp | search_basis | search_basis_exp | seiz_basis | prop_type | cont | actions | act_consent | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 84362 | 1 | 84362_1 | CA0371100 | SD | 10 | 2019-01-01 | 00:15:07 | 30 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 3500.0 | NaN | UNIVERSITY | NaN | 0 | NaN | SAN DIEGO | 839 | Cherokee Point 839 | 0 | 1 | 30 | Male | 0 | 1 | NaN | No | hisp | None | Traffic Violation | 54116.0 | 27150(A) VC - INADEQUATE MUFFLERS (I) 54116 | Equipment Violation | LOUD EXHAUST | Vehicle inventory | IMPOUNDED | NaN | NaN | None | Search of property was conducted|Vehicle impounded | NA|NA |
| 1 | 2 | 84364 | 1 | 84364_1 | CA0371100 | SD | 2 | 2019-01-01 | 00:15:16 | 10 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 7500.0 | NaN | hillside dr | NaN | 0 | NaN | LA JOLLA | 124 | La Jolla 124 | 0 | 0 | 44 | Female | 0 | 2 | NaN | No | white | None | Reasonable Suspicion | 53130.0 | 415(2) PC - LOUD/UNREASONABLE NOISE (I) 53130 | Officer witnessed commission of a crime | loud party | NaN | NaN | NaN | NaN | None | Curbside detention | NaN |
| 2 | 3 | 84365 | 1 | 84365_1 | CA0371100 | SD | 1 | 2019-01-01 | 00:02:00 | 5 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 1300.0 | NaN | ocean blvd | NaN | 0 | NaN | SAN DIEGO | 122 | Pacific Beach 122 | 0 | 0 | 30 | Female | 0 | 2 | NaN | No | white | None | Reasonable Suspicion | 64005.0 | 647(F) PC - DISORD CONDUCT:ALCOHOL (M) 64005 | Officer witnessed commission of a crime | stumbling back and forth, unable to maintain balance | Incident to arrest | search incident to arrest | NaN | NaN | None | Patrol car detention|Handcuffed or flex cuffed|Search of person was conducted | NA|NA|NA |
| 3 | 4 | 84366 | 1 | 84366_1 | CA0371100 | SD | 1 | 2019-01-01 | 00:38:00 | 5 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 800.0 | NaN | garnet | NaN | 0 | NaN | SAN DIEGO | 122 | Pacific Beach 122 | 0 | 0 | 25 | Male | 0 | 1 | NaN | No | hisp | None | Reasonable Suspicion | 64005.0 | 647(F) PC - DISORD CONDUCT:ALCOHOL (M) 64005 | Officer witnessed commission of a crime | fighting with security | Incident to arrest | search incident to arrest | NaN | NaN | None | Curbside detention|Handcuffed or flex cuffed|Search of person was conducted | NA|NA|NA |
| 4 | 5 | 84369 | 1 | 84369_1 | CA0371100 | SD | 17 | 2019-01-01 | 01:06:41 | 2 | 1 | 1 | Patrol, traffic enforcement, field operations | NaN | 4400.0 | NaN | coronado | NaN | 0 | NaN | SAN DIEGO | 614 | Ocean Beach 614 | 0 | 1 | 40 | Male | 0 | 1 | NaN | No | black | None | Reasonable Suspicion | 32022.0 | 602 PC - TRESPASSING (M) 32022 | Matched suspect description | rc of male at vacant house | NaN | NaN | NaN | NaN | None | None | NaN |
| 5 | 6 | 84370 | 1 | 84370_1 | CA0371100 | SD | 1 | 2019-01-01 | 01:11:05 | 5 | 0 | 1 | Patrol, traffic enforcement, field operations | governor dr | NaN | NaN | radcliffe | NaN | 0 | NaN | SAN DIEGO | 115 | University City 115 | 0 | 0 | 75 | Female | 0 | 2 | NaN | No | white | None | Traffic Violation | 54110.0 | 24601 VC - FAIL MAINT LIC PLATE LAMP (I) 54110 | Equipment Violation | no license plate lights | NaN | NaN | NaN | NaN | None | None | NaN |
| 6 | 7 | 84371 | 1 | 84371_1 | CA0371100 | SD | 1 | 2019-01-01 | 01:15:56 | 60 | 0 | 1 | Patrol, traffic enforcement, field operations | la jolla village dr | NaN | NaN | villa la jolla dr | NaN | 0 | NaN | SAN DIEGO | 126 | Torrey Pines 126 | 0 | 0 | 45 | Male | 0 | 1 | NaN | No | white | None | Traffic Violation | 54056.0 | 20002 VC - HIT AND RUN (M) 54056 | Moving Violation | drive hit victim vehicle causing damage and minor injury and fled on foot | NaN | NaN | NaN | NaN | None | None | NaN |
| 7 | 8 | 84372 | 1 | 84372_1 | CA0371100 | SD | 2 | 2019-01-01 | 01:10:54 | 10 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 1000.0 | NaN | pacific beach dr | NaN | 0 | NaN | SAN DIEGO | 122 | Pacific Beach 122 | 0 | 0 | 25 | Male | 0 | 1 | NaN | No | hisp | None | Reasonable Suspicion | 64005.0 | 647(F) PC - DISORD CONDUCT:ALCOHOL (M) 64005 | Officer witnessed commission of a crime | fell in street | NaN | NaN | NaN | NaN | None | Curbside detention | NaN |
| 8 | 9 | 84372 | 2 | 84372_2 | CA0371100 | SD | 2 | 2019-01-01 | 01:10:54 | 10 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 1000.0 | NaN | pacific beach dr | NaN | 0 | NaN | SAN DIEGO | 122 | Pacific Beach 122 | 0 | 0 | 23 | Female | 0 | 2 | NaN | No | hisp | None | Reasonable Suspicion | 64005.0 | 647(F) PC - DISORD CONDUCT:ALCOHOL (M) 64005 | Officer witnessed commission of a crime | fell in street | NaN | NaN | NaN | NaN | None | Curbside detention | NaN |
| 9 | 10 | 84373 | 1 | 84373_1 | CA0371100 | SD | 9 | 2019-01-01 | 01:10:52 | 5 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 300.0 | NaN | 5th Av | NaN | 0 | NaN | SAN DIEGO | 523 | Gaslamp 523 | 0 | 0 | 21 | Male | 0 | 1 | NaN | No | hisp | None | Reasonable Suspicion | 64005.0 | 647(F) PC - DISORD CONDUCT:ALCOHOL (M) 64005 | Officer witnessed commission of a crime | Male drunk in public unable to care for himself | Incident to arrest | Male drunk in public | NaN | NaN | None | Handcuffed or flex cuffed|Search of person was conducted | NA|NA |
| Unnamed: 0 | stop_id | pid | id | ori | agency | exp_years | date | time | dur | is_serv | assign_key | assign_words | inters | block | ldmk | street | hw_exit | is_school | school_name | city | beat | beat_name | is_student | lim_eng | age | gender_words | is_gendnc | gender_code | gendnc_code | lgbt | race | disability | reason_words | reasonid | reason_text | reason_detail | reason_exp | search_basis | search_basis_exp | seiz_basis | prop_type | cont | actions | act_consent | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 69804 | 69805 | 449687 | 3 | 449687_3 | CA0371100 | SD | 1 | 2021-06-30 | 15:35:00 | 60 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 100.0 | NaN | W San Ysidro blvd | NaN | 0 | NaN | SAN YSIDRO | 712 | San Ysidro 712 | 0 | 0 | 60 | Male | 0 | 1 | NaN | No | white | None | Traffic Violation | 54431.0 | 24951(B) VC - TURN SIGNAL VIOLATION (I) 54431 | Moving Violation | pulled over vehicle for not using turn signal for a lane change and for the 3rd brakelight being out. | NaN | NaN | NaN | NaN | None | Person removed from vehicle by order | NaN |
| 69805 | 69806 | 449692 | 1 | 449692_1 | CA0371100 | SD | 1 | 2021-06-30 | 22:46:00 | 20 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 4300.0 | NaN | University | NaN | 0 | NaN | SAN DIEGO | 832 | Teralta West 832 | 0 | 0 | 18 | Male | 0 | 1 | NaN | No | hisp | None | Traffic Violation | 54168.0 | 5204(A) VC - EXPIRED TABS/FAIL DISPLAY (I) 54168 | Non-moving Violation, including Registration Violation | displayed reg expired over 6 months | NaN | NaN | NaN | NaN | None | None | NaN |
| 69806 | 69807 | 449693 | 1 | 449693_1 | CA0371100 | SD | 1 | 2021-06-30 | 15:00:00 | 30 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 200.0 | NaN | Via De San Ysidro | NaN | 0 | NaN | SAN YSIDRO | 712 | San Ysidro 712 | 0 | 0 | 30 | Male | 0 | 1 | NaN | No | hisp | None | Traffic Violation | 54649.0 | 24603(D) VC - STOPLAMPS:VEH 2 REQUIRED (I) 54649 | Equipment Violation | Subject had a brakeligh that was out. | Condition of parole / probation/ PRCS / mandatory supervision | NaN | NaN | NaN | None | Person removed from vehicle by order|Curbside detention|Search of property was conducted|Search of person was conducted | NA|NA|NA|NA |
| 69807 | 69808 | 449693 | 2 | 449693_2 | CA0371100 | SD | 1 | 2021-06-30 | 15:00:00 | 30 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 200.0 | NaN | Via De San Ysidro | NaN | 0 | NaN | SAN YSIDRO | 712 | San Ysidro 712 | 0 | 0 | 30 | Female | 0 | 2 | NaN | No | hisp | None | Traffic Violation | 54649.0 | 24603(D) VC - STOPLAMPS:VEH 2 REQUIRED (I) 54649 | Equipment Violation | Subject had a brakelight that was out. | Condition of parole / probation/ PRCS / mandatory supervision | NaN | NaN | NaN | None | Search of property was conducted|Person removed from vehicle by order | NA|NA |
| 69808 | 69809 | 449694 | 1 | 449694_1 | CA0371100 | SD | 1 | 2021-06-30 | 23:30:00 | 5 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 5600.0 | NaN | ECB | NaN | 0 | NaN | SAN DIEGO | 821 | Rolando 821 | 0 | 0 | 25 | Male | 0 | 1 | NaN | No | white | None | Traffic Violation | 54168.0 | 5204(A) VC - EXPIRED TABS/FAIL DISPLAY (I) 54168 | Non-moving Violation, including Registration Violation | expired reg over 6 months | NaN | NaN | NaN | NaN | None | None | NaN |
| 69809 | 69810 | 449701 | 1 | 449701_1 | CA0371100 | SD | 1 | 2021-06-30 | 21:36:15 | 5 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 500.0 | NaN | Saturn | NaN | 0 | NaN | SAN DIEGO | 721 | Egger Highlands 721 | 0 | 0 | 50 | Male | 0 | 1 | NaN | No | black | None | Reasonable Suspicion | 53130.0 | 415(2) PC - LOUD/UNREASONABLE NOISE (I) 53130 | Matched suspect description | Radio call of large group of people at a vehicle making loud noise. | NaN | NaN | NaN | NaN | None | Curbside detention | NaN |
| 69810 | 69811 | 449709 | 1 | 449709_1 | CA0371100 | SD | 5 | 2021-06-30 | 23:29:46 | 10 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 300.0 | NaN | 17th st | NaN | 0 | NaN | SAN DIEGO | 521 | East Village 521 | 0 | 0 | 40 | Male | 0 | 1 | NaN | No | white | None | Traffic Violation | 54427.0 | 21800(D) VC - FAIL STOP/YIELD:INOP SIGN (I) 54427 | Moving Violation | didnt stop at stop sign | NaN | NaN | NaN | NaN | None | None | NaN |
| 69811 | 69812 | 449716 | 1 | 449716_1 | CA0371100 | SD | 1 | 2021-06-30 | 23:45:00 | 20 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 4200.0 | NaN | del sol ct | NaN | 0 | NaN | SAN DIEGO | 723 | Otay Mesa West 723 | 0 | 0 | 30 | Male | 0 | 1 | NaN | No | hisp | None | Reasonable Suspicion | 99999.0 | NA - XX AA - CODE NOT FOUND IN TABLE (X) 99999 | Matched suspect description | 415 - subj and later found to be in uncles driveway | NaN | NaN | NaN | NaN | None | Person removed from vehicle by order | NaN |
| 69812 | 69813 | 449726 | 1 | 449726_1 | CA0371100 | SD | 11 | 2021-06-30 | 15:54:00 | 12 | 0 | 1 | Patrol, traffic enforcement, field operations | 15 SOUTH / AERO DRIVE | NaN | NaN | NaN | NaN | 0 | NaN | SAN DIEGO | 313 | Kearney Mesa 313 | 0 | 0 | 40 | Female | 0 | 2 | NaN | No | hisp | None | Traffic Violation | 54566.0 | 23123(A) VC - USE CELLPH W/DRIV W/O HFD (I) 54566 | Moving Violation | CELL PHONE | NaN | NaN | NaN | NaN | None | None | NaN |
| 69813 | 69814 | 449933 | 1 | 449933_1 | CA0371100 | SD | 1 | 2021-06-30 | 17:45:00 | 120 | 0 | 1 | Patrol, traffic enforcement, field operations | NaN | 4200.0 | NaN | mISSION BLVD | NaN | 0 | NaN | SAN DIEGO | 122 | Pacific Beach 122 | 0 | 0 | 30 | Male | 0 | 1 | NaN | No | hisp | None | Reasonable Suspicion | 13219.0 | 245(A)(1) PC - ADW NOT FIREARM (F) 13219 | Matched suspect description | RADIO CALL REGARDING A MALE HITTING ANOTHER MALE WITH A CROWBAR | Incident to arrest | 245PC | NaN | NaN | None | Handcuffed or flex cuffed|Search of person was conducted|Curbside detention|Patrol car detention | NA|NA|NA|NA |